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(54) Title: MODIFIED PEPTIDES AS THERAPEUTIC AGENTS 
(57) Abstract 

The present invention concerns fusion of Fc domains with biologically active peptides and a process for preparing pharmaceutical 
agents using biologically active peptides. In this invention, pharmacologically active compounds are prepared by a process comprising: a) 
selecting at least one peptide that modulates the activity of a protein of interest; and b) preparing a pharmacologic agent comprising an Fc 
domain covalently linked to at least one amino acid of the selected peptide. Linkage to the vehicle increases the half-life of the peptide, 
which otherwise would be quickly degraded in vivo. The preferred vehicle is an Fc domain. The peptide is preferably selected by phage 
display, £. coli display, ribosome display, RNA-peptide screening, or chemical-peptide screening. 
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Modified Peptides as Therapeutic Agents 
Background of the Invention 

Recombinant proteins are an emerging class of therapeutic agents. 
5 Such recombinant therapeutics have engendered advances in protein 
formulation and chemical modification. Such modifications can protect 
therapeutic proteins, primarily by blocking their exposure to proteolytic 
enzymes. Protein modifications may also increase the therapeutic 
protein's stability, circulation time, and biological activity. A review 

1 0 article describing protein modification and fusion proteins is Francis 
(1992), Focus on Growth Factors 3:4-10 (Mediscript, London), which is 
hereby incorporated by reference. 

One useful modification is combination with the "Fc" domain of an 
antibody. Antibodies comprise two functionally independent parts, a 

1 5 variable domain known as "Fab", which binds antigen, and a constant 
domain known as "Fc", which links to such effector functions as 
complement activation and attack by phagocytic cells. An Fc has a long 
serum half-life, whereas an Fab is short-lived. Capon etaL (1989), Nature 
337: 525-31. When constructed together with a therapeutic protein, an Fc 

2 0 domain can provide longer half-lif e or incorporate such functions as Fc 
receptor binding, protein A binding, complement fixation and perhaps 
even placental transfer. Id. Table 1 summarizes use of Fc fusions known in 
the art. 
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Table 1— Fc fusion with therapeutic proteins 



Form of Fc 


Fusion 
partner 


Therapeutic 
implications 


Reference 


igGi 


N-terminus of 
CD30-L 


Hodgkin's disease; 
anaplastic lymphoma; T- 

ppII Ipjikpmia 


U.S. Patent No. 
5,480,981 


Murine Fcy2a 


IL-10 


anti-inflammatory; 

tranenlant roiftptinn 


Zheng fiLal. (1995), *L 
Immunol 154:5590-600 


lgG1 


TNF receptor 


septic shock 


Fisher eta!. (1996), H 
Ennl J Mad 334" 1697- 

1702; Van Zee, K. et al. 
J. Immunol. 156: 

2221-30 


IgG, IgA, 
IgM, or IgE 
(excluding 
the first 
domain) 


I rir recepior 


inflflmmatinn autoimmune 

li lllalltlliaiiui 1, ouiwn • n nunc 

disorders 


U.S. Pat. No. 5,808,029, 
issued September 1 5, 
1998 


lgG1 


CD4 receptor 


AIDS 


Capon et air (1989), 
Nature 337: 525-31 


lgG1, 
IPG3 


N-terminus 
of IL-2 


anti-cancer, antiviral 


Harvill et al. (1995), 
Immunotech. 1:95-105 


IgGi 


C-terminus of 
OPG 


osteoarthritis; 
bone density^ 


WO 97/23614, published 
July 3, 1997 


IgGi 


N-terminus of 
leptin 


anti-obesity 


PCT/US 97/23183, filed 
December 11, 1997 


Human Ig 
Cy1 


CTLA-4 


autoimmune disorders 


Linsley (1991), 
Med. 174:561-9 



A much different approach to development of therapeutic agents is 
peptide library screening. The interaction of a protein ligand with its 
5 receptor often takes place at a relatively large interface. However, as 

demonstrated for human growth hormone and its receptor, only a few key 
residues at the interface contribute to most of the binding energy. 
Clackson etaL (1995), Science 267: 383-6. The bulk of the protein ligand 
merely displays the binding epitopes in the right topology or serves 
1 0 functions unrelated to binding. Thus, molecules of only "peptide" length 
(2 to 40 amino acids) can bind to the receptor protein of a given large 
protein ligand. Such peptides may mimic the bioactivity oTtheiarge 
protein ligand ("peptide agonists") or, through competitive binding, 
inhibit the bioactivity of the large protein ligand ("peptide antagonists"). 



WO 00/24782 



PCT/US99/25044 



Phage display peptide libraries have emerged as a powerful 
method in identifying such peptide agonists and antagonists. See, for 
example, Scott etal. (1990), Science 249: 386; Devlin etal. (1990), Science 
249: 404; U.S. Pat. No. 5,223,409, issued June 29, 1993; U.S. Pat. No. 
5 5,733,731, issued March 31, 1998; U.S. Pat. No. 5,498,530, issued March 12, 
1996; U.S. Pat. No. 5,432,018, issued July 11, 1995; U.S. Pat. No. 5,338,665, 
issued August 16, 1994; U.S. Pat. No. 5,922,545, issued July 13, 1999; WO 
96/40987, published December 19, 1996; and WO 98/15833, published 
April 16, 1998 (each of which is incorporated by reference). In such 

1 0 libraries, random peptide sequences are displayed by fusion with coat 
proteins of filamentous phage. Typically, the displayed peptides are 
affinity-eluted against an antibody-immobilized extracellular domain of a 
receptor. The retained phages may be enriched by successive rounds of 
affinity purification and repropagation. The best binding peptides may be 

1 5 sequenced to identify key residues within one or more structurally related 
families of peptides. See, e.g., Cwirla etal (1997), Science 276: 1696-9, in 
which two distinct families were identified. The peptide sequences may 
also suggest which residues may be safely replaced by alanine scanning or 
by mutagenesis at the DNA level. Mutagenesis libraries may be created 

2 0 and screened to further optimize the sequence of the best binders. 
Lowman (1997), Ann. Rev. Biophvs. Biomol. Struct. 26: 401-24. 

Structural analysis of protein-protein interaction may also be used 
to suggest peptides that mimic the binding activity of large protein 
ligands. In such an analysis, the crystal structure may suggest the identity 

25 and relative orientation of critical residues of the large protein ligand, 
from which a peptide may be designed. See, e.g., Takasaki etal. (1997), 
Nature Biotech. 15: 1266-70. These analytical methods may-alsabe used to . 
investigate the interaction between a receptor protein and peptides 
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selected by phage display, which may suggest further modification of the 
peptides to increase binding affinity. - - ... . 

Other methods compete with phage display in peptide research. A 
peptide library can be fused to the carboxyl terminus of the lac repressor 
5 and expressed in E. coli . Another E. coli -based method allows display on 
the cell's outer membrane by fusion with a peptidoglycan-associated 
lipoprotein (PAL). Hereinafter, these and related methods are collectively 
referred to as " E. coli display." In another method, translation of random 
RN A is halted prior to ribosome release, resulting in a library of 

1 0 polypeptides with their associated RNA still attached. Hereinafter, this 
and related methods are collectively referred to as "ribosome display." 
Other methods employ chemical linkage of peptides to RNA; see, for 
example, Roberts & Szostak (1997), Proc. Natl. Aca d. Sci. USA. 94: 12297- 
303. Hereinafter, this and related methods are collectively referred to as 

1 5 "RN A-peptide screening." Chemically derived peptide libraries have been 
developed in which peptides are immobilized on stable, non-biological 
materials, such as polyethylene rods or solvent-permeable resins. Another 
chemically derived peptide library uses photolithography to scan peptides 
immobilized on glass slides. Hereinafter, these and related methods are 

2 0 collectively referred to as "chemical-peptide screening." Chemical-peptide 
screening may be advantageous in that it allows use of D-amino acids and 
other unnatural analogues, as well as non-peptide elements. Both 
biological and chemical methods are reviewed in Wells & Lowman (1992), 
Curr. Qpin. Biotechnol. 3: 355-62. 

2 5 Conceptually, one may discover peptide mimetics of any protein 

using phage display and the other methods mentioned above. These 
methods have been used for epitope mapping, for identification of critical _ 
amino acids in protein-protein interactions, and as leads for the discovery 
of new therapeutic agents. E.g., Cortese etaL (1996), Curr. Qpin. Biotech. 7: 
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10 



616-21. Peptide libraries are now being used most often in immunological 
studies, such as epitope mapping. Kreeger (1996), The Scientist 10(13): 19- 

20. 

Of particular interest here is use of peptide libraries and other 
techniques in the discovery of pharmacologically active peptides. A 
number of such peptides identified in the art are summarized in Table 2. 
The peptides are described in the listed publications, each of which is 
hereby incorporated by reference. The pharmacologic activity of the 
peptides is described, and in many instances is followed by a shorthand 
term therefor in parentheses. Some of these peptides have been modified 
(e.g., to form C-terminally cross-linked dimers). Typically, peptide 
libraries were screened for binding to a receptor for a pharmacologically 
active protein (e.g., EPO receptor). In at least one instance (CTLA4), the 
peptide library was screened for binding to a monclonal antibody. 
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Table 2— Pharmacologically active peptides 



Form of 
peptide 

intrapeptide 
disulfide- 
bonded 



C-terminally 
cross-linked 
dimer 



linear 



linear 



C-terminally 
cross-linked 
dimer 
disulfide- 
linked dimer 



alkylene- 
linked dimer 



Binding 
partner/ 
protein of 
interest* 
EPO receptor 



linear 



EPO receptor 



EPO receptor 



c-Mp! 



c-MpI 



IL-1 receptor 



Pharmacologic 
activity 

EPO-mimetic 



Reference 



Wrighton fiLal. (1996), 
Science 273: 458-63; 
U.S. Pat. No. 5,773,569, 
issued June 30, 1998 to 
Wrinhton et al. 



EPO-mimetic 



Uvnah eLal. (1996), 
5cifiDCfi273: 464-71; 
Wrighton eLal. (1997), 
j^ire Biotechnology 15: 
1261-5; International 
patent application WO 
96/40772, published 
Dec. 19, 1996 



EPO-mimetic 



Naranda fiLal. (1999), 
PrOC, Natl. Acad. Sci. 

US^. 96:7569-74 



TPO-mimetic 



TPO-mimetic 



stimulation of 
hematopoiesis 
("G-CSF-mimetic") 



CwirlafiLfll.(1997) 
Science 276: 1696-9; 
U.S. Pat. No. 5,869,451 , 
issued Feb. 9,1999; U.S. 
Pat. No. 5,932,946, 
issued Aug. 3, 1999 
CwirlasLal. (1997), 

Science 276: 1696-9 



Paukovits eLal. (1984), 
Hnppe-Sevlers Z. 
Physiol. Chem . 365: 303- 
11;Laerum fiLal. (1988), 
Exp. Hemat.16: 274-80 



G-CSF-mimetic 



inflammatory and 
autoimmune diseases 
(IL-1 antagonist" or 
1L-1ra-mimetic") 



Bhatnagar flLal- (1996), 
J. Med. Chem . 39: 3814- 
9; Cuthbertson eLal. 
(1997), .1 Med. Chem. 
40: 2876-82; King fiLal. 
(1991), Exp. Hematol. 
19:481; King flLfll. 

(i995).fiicQd86 (Suppi. 

1)" 309a 
U.S. Pat. No. 5,608,035; 
U.S. Pat. No. 5,786,331 ; 
U.S-Pat. No. 5,880,096; 
Yanofsky fiLal. (1996), 



• The protein listed in this column may be bound by the associated W^JfJJ?. 
receptor IL-1 receptor) or mimicked by the associated peptide. The references listed for 
each clarify whether the molecule is bound by or mimicked by the peptides. 
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Proc, Natl, Acad, Sci. 93: 

7381-6; Akeson et al . 
(1996), J. Biol. Chem . 
271:30517-23; 
Wiekzorek fiLal. (1997), 

Pol, J. Pharmacol. 49: 

107-17; Yanofsky (1996), 
PNAs, 93:7381-7386. 



linear 



Facteur 
thymique 
serique (FTS) 



stimulation of 
lymphocytes 
("FTS-mimetic") 



inagaki-Ohara et al . 
(1996), Cellular Immunol. 
1 71 : 30-40; Yoshida 
(1984), IflLA 
ImmunopharmacoL 

6:141-6. 



intrapeptide 
disulfide 
bonded 



CTLA4 MAb 



CTLA4-mimetic 



FukumotOfiLaL(1998), 
Nature Biotech. 16: 267- 
70 



exocyclic TNF-a receptor 



TNF-a antagonist 



Takasaki fiLal. (1997), 
Nature Biotech . 15:1266- 
70; WO 98/53842, 
published December 3, 
1998 



linear 



TNF-a receptor 



TNF-a antagonist 



Chirinos-Rojas ( ), J* 
Imm.. 5621-5626. 



intrapeptide 
disulfide 
bonded 



C3b 



inhibition of complement 
activation; autoimmune 
diseases 
("C3b-antagonisn 



Sahu et al . (1996), J* 
Immunol. 157: 884-91; 
Morikis fiLal. (1998), 
PfWlngd- 7:619-27 



linear 



vinculin 



cell adhesion processes'— 
cell growth, differentiation, 
wound healing, tumor 
metastasis ("vinculin 
binding") 



Adey fiLfll- (1997), 
Piochem. J. 324: 523-8 



linear 



C4 binding 
protein (C4BP) 



antithrombotic 



Linse et al . (1997),^ 
Biol. Chem . 272:14658- 
65 



linear 



urokinase 
receptor 



processes associated with 
urokinase interaction with 

its receptor (e.g., 
angiogenesis, tumor cell 
invasion and metastasis); 
rUKR antagonist") 



Goodson fiLal- (1994), 

Proc, Natl, Acad, Sci- 91: 

7129-33; International 
application WO 
97/35969, published 
October 2, 1997 



linear 



Mdm2, Hdm2 



Inhibition of inactivation of 
p53 mediated by Mdm2 or 

hdm2; anti-tumor 
("Mdm/hdm antagonist") 



Picksley fiLal. (1994), 
Oncog ene 9: 2523-9; 
Bottger fiLal. (1997) J, 
Mol. BioL 269: 744-56; 
Bottger fiLal. (1996), 



linear 


p21 WAn . ■ 


anti-tumor by mimicking 
the activity of p21 WAF1 


Bait fiLal- (1 997), Curr. „ . 
Biol. 7:71-80 


linear 


farnesyl 


anti-cancer by preventing 


Gibbs et al. (1994), Cell 



b FTS is a thymic hormone mimicked by the molecule of this invention rather than a 
receptor bound by the molecule of this invention. 

0 
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linear 



linear 



linear 



linear 



linear 



linear 



linear 



linear 



linear, 
cyclized 



linear, 
cyclized- 



transferase 
Ras effector 
domain 



SH2/SH3 
domains 



p16' 



Src, Lyn 



Mast cell 
protease 



SH3 domains 



HBV core 
antigen (HBcAg) 

se lectins 



calmodulin 



integrins 



activation of ras oncogene 
anti-cancer by inhibiting 
biological function of the 
ras oncogene 

anti-cancer by inhibiting 

tumor growth with 
activated tyrosine kinases 

anti-cancer by mimicking 
activity of p16;e.g M 
inhibiting cyclin D-Cdk 
complex (*p1 e^metis") 
inhibition of Mast cell 
activation, IgE-related 

conditions, type I 
hypersensitivity ("Mast 

cell antagonist") 
treatment of inflammatory 
disorders mediated by 
release of tryptase-6 
("Mast cell protease 

inhibitors^ 
treatment of SH3- 
mediated disease states 
("SH3 antagonist") 



treatment of HBV viral 
infections ( U anti-HBV) 

neutrophil adhesion; 
inflammatory diseases 
("selectin antagonist") 



tumor-homing; treatment 
for conditions related to 
integrin-mediated cellular 
events, including platelet 
aggregation, thrombosis, 
wound healing, 
osteoporosis, tissue 
repair, anqjpQene sis (e.t 



77:175-178 
Moodie et al. (1994), 
-Trends Genet 10: 44-48 
Rodriguez et al. (1994), 
Nature 370:527-532 
Pawsonetal (1993), 

Curr. Biol. 3:434-432 
Yuetal. (1994), Cell 
76:933-945 
F4hraeus fiLa!. (1996), 
£U!L_Bial. 6:84-91 



Stauffer fiLal. (1997), 
Biochem . 36: 9388-94 



International application 
WO 98/33812, published 
August 6, 1998 



RicklesfiLal. (1994), 
EMBOJ . 13: 5598-5604; 
Sparks sLal. (1 994), J* 
Biol. Chem . 269: 23853- 
6; Sparks sLal. (1996), 
p m fMatl. Acad. ScL 93: 

1540-4 



Dyson & Muray (1995), 
P r^n, Natl. Acad. Sci . 92: 

2194-8 

Martens sLal. (1 995), 
Sialism. 270: 21 129- 
36; European patent 
application EP0 714 
912, published June 5, 

1996 

Pierce sLal. (1995), 
fuinlfl^ Diversity 1:259- 

65; Dedman sLal- 
(1993), -l BiflL Chem. 
268:23025-30; Adey& 
Kay (1996), Qsn& 169: 

133-4 

International applications 
WO 95/1 471 4, published 
Juris 1, 1995; WO 
97/08203, published 
March 6. 1997; WO 
98/10795, published 
March 19, 1998; WO 
99/24462, published Ma) 
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cyclic, linear 



fibronectin and 
extracellular 
matrix 
components of T 
cells and 
macrophages 



for treatment of cancer), 
and tumor invasion 

["integrin-bindint 

treatment of inflammatory 
and autoimmune 
conditions 



20,1999; Kraft sLal 
(1999). J. Biol. Chem. 
274:1979-1985 
WO 98/09985, published 
March 12,1998 



linear 



somatostatin 
and cortistatin 



linear 



linear or 
cyclic, 
including D- 
amino acids 
linear, cyclic 



linear 



bacteria! 
lipopolysac- 
charide 
pardaxin, mellitin 



treatment or prevention of 
hormone-producing 
tumors, acromegaly, 
giantism, dementia, 
gastric ulcer, tumor 
growth, inhibition of 
hormone secretion, 
modulation of sleep or 

neural activit 
antibiotic; septic shock; 
disorders modulatable by 
CAP37 
antipathogenic 



European patent 
application 0 91 1 393, 
published April 28,1999 



U.S. Pat. No. 5,877,151, 
issued March 2,1999 

WO 97/31019, published 
28 August 1997 




CTLs 



impotence, 
neurodegenerative 
disorders 
cancer 



EP 0 770 624, published 
May 2,1997 



linear 



THF-gamma2 



Burnstein (1988). 
Biochem.. 27:4066-71. 



linear 



Amylin 



Cooper (1987),£iQ£L 

Natl. Acad. ScL 
84:8628-32. 



linear 



cyclic, linear 



Adrenomedullin 



VEGF 



cyclic 



MMP 



H GH fragment 
Echistatin 



anti-angiogenic; cancer, 
rheumatoid arthritis, 
diabetic retinopathy, 
psoriasis ("VEGF 

antagonisr 
inflammation and 
autoimmune disorders; 
tumor growth 
"MMP inhibitor") 

inhibition of platelet 
aggregation 



Kitamura (1993),fiEB£, 
192:553-60. 
Fairbrother (1998), 
BiOCtlfim., 37:1 7754- 

17764. 



Koivunen (1999), Nature 
BiQteCtl., 17:768-774. 



U.S. Pat. No. 5.869,452 

Gan (1988) l4 LBioL 
Chem.. 263:19827-32. „ 



linear 



SLE 
autoantibody 

GD1 alpha 



SLE 



suppression of tumor 
metastasis 
ftndothel ijdcell activation 

9 



WO 96/30057. published 
October 3, 1996 

Ishikawa sLfll. (1998), 
441 (1):20-4 

Blank fft a l (1" 9 )i Proc » 
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beta-2- 
glycoprotein-l 
(B2GPI) 
antibodies 



antiphospholipid 
syndrome (APS), 
thromboembolic 

phenomena, 
thrombocytopenia, and 
recurrent fetal loss 



Njfll, A/fflfl Rd- USA 96: 
5164-8 



linear 



T Cell Receptor 
beta chain 



diabetes 



WO 96/1 121 4, published 
April 18, 1996 



Peptides identified by peptide library screening have been regarded 
as "leads" in development of therapeutic agents rather than as therapeutic 
agents themselves. Like other proteins and peptides, they would be 
5 rapidly removed in vivo either by renal filtration, cellular clearance 

mechanisms in the reticuloendothelial system, or proteolytic degradation. 
Francis (1992), Focus ™ Onwth Factors 3: 4-11. As a result, the art 
presently uses the identified peptides to validate drug targets or as 
scaffolds for design of organic compounds that might not have been as 

1 o easily or as quickly identified through chemical library screening. 

Lowman (1997), *™ *py Bioohvs Biomol. Struct . 26: 401-24; Kay etal. 
(1998), nni ff Disc. Today 3: 370-8. The art would benefit from a process by 
which such peptides could more readily yield therapeutic agents. 

Summary of the Invention 

1 5 The present invention concerns a process by which the inyjvo half- 

life of one or more biologically active peptides is increased by fusion with 
a vehicle. In this invention, pharmacologically active compounds are 
prepared by a process comprising: 

a) selecting at least one peptide that modulates the activity of a 
20 protein of interest; and 

b) preparing a pharmacologic agent comprising at least one 
vehicle covalently linked to at least one amino acid sequence 
of the selected peptide. 

The preferred vehicle is an Fc domain. The peptides screened in step (a) 
25 are preferably expressed in a phage display library. The vehicle and the 

Id 
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peptide may be linked through the N- or C-terminus of the peptide or the 
vehicle, as described further below. Derivatives of the above compounds 
(described below) are also encompassed by this invention. 

The compounds of this invention may be prepared by standard 
5 synthetic methods, recombinant DNA techniques, or any other methods of 
preparing peptides and fusion proteins. Compounds of this invention that 
encompass non-peptide portions may be synthesized by standard organic 
chemistry reactions, in addition to standard peptide chemistry reactions 
when applicable. 

10 The primary use contemplated is as therapeutic or prophylactic 

agents. The vehicle-linked peptide may have activity comparable to— or 
even greater than— the natural ligand mimicked by the peptide. In 
addition, certain natural ligand-based therapeutic agents might induce 
antibodies against the patient's own endogenous ligand; the vehicle-linked 

1 5 peptide avoids this pitfall by having litde or typically no sequence identity 

with the natural ligand. 

Although mostly contemplated as therapeutic agents, compounds 
of this invention may also be useful in screening for such agents. For 
example, one could use an Fc-peptide (e.g., Fc-SH2 domain peptide) in an 
2 0 assay employing anti-Fc coated plates. The vehicle, especially Fc, may 
make insoluble peptides soluble and thus useful in a number of assays. 

The compounds of this invention may be used for therapeutic or 
prophylactic purposes by formulating them with appropriate 
pharmaceutical carrier materials and administering an effective amount to 
2 5 a patient, such as a human (or other mammal) in need thereof. Other 
related aspects are also included in the instant invention. 

Numerous additional aspects and advantages of the present 

invention will become apparent upon consideration of the figures and 
detailed description of the invention. 
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Brief Description of the Figures 
Figure 1 shows a schematic representation of an exemplary process 
of the invention. In this preferred process, the vehicle is an Fc domain, 
which is linked to the peptide covalently by expression from a DNA 
5 construct encoding both the Fc domain and the peptide. As noted in 
Figure 1, the Fc domains spontaneously form a dimer in this process. 

Figure 2 shows exemplary Fc dimers that may be derived from an 
IgGl antibody. "Fc" in the figure represents any of the Fc variants within 
the meaning of "Fc domain" herein. "X 1 " and "X 2 " represent peptides or 
1 0 linker-peptide combinations as defined hereinafter. The specific dimers are 
as follows: 

' A, D: Single disulfide-bonded dimers. IgGl antibodies typically 
have two disulfide bonds at the hinge region between the constant and 
variable domains. The Fc domain in Figures 2A and 2 D may be formed by 
1 5 truncation between the two disulfide bond sites or by substitution of a 
cysteinyl residue with an unreactive residue (e.g., alanyl). In Figure 2A, 
the Fc domain is linked at the amino terminus of the peptides; in 2D, at the 

carboxyl terminus. 

B, E: Doubly disulfide-bonded dimers. This Fc domain may be 
20 formed by truncation of the parent antibody to retain both cysteinyl 
residues in the Fc domain chains or by expression from a construct 
including a sequence encoding such an Fc domain. In Figure 2B, the Fc 
domain is linked at the amino terminus of the peptides; in 2E, at the 

carboxyl terminus. 
25 C, F: Noncovalent dimers. This Fc domain may be formed by 

elimination of the cysteinyl residues by either truncation or substitution. 
One may desire to eliminate the cysteinyl residues to avoia impurities 
formed by reaction of the cysteinyl residue with cysteinyl residues of other 
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proteins present in the host cell. The noncovalent bonding of the Fc 
domains is sufficient to hold together the dimer. 

Other dimers may be formed by using Fc domains derived from different 
types of antibodies (e.g., IgG2, IgM). 
5 Figure 3 shows the structure of preferred compounds of the 

invention that feature tandem repeats of the pharmacologically active 
peptide. Figure 3A shows a single chain molecule and may also represent 
the DNA construct for the molecule. Figure 3B shows a dimer in which the 
linker-peptide portion is present on only one chain of the dimer. Figure 3C 

1 0 shows a dimer having the peptide portion on both chains. The dimer of 
Figure 3C will form spontaneously in certain host cells upon expression of 
a DNA construct encoding the single chain shown in Figure 3A. In other 
host cells, the cells could be placed in conditions favoring formation of 
dimers or the dimers can be formed in vitro . 

1 5 Figure 4 shows exemplary nucleic acid and amino acid sequences 

(SEQ ID NOS: 1 and 2, respectively) of human IgGl Fc mat may be used in 
this invention. 

Figure 5 shows a synthetic scheme for the preparation of PEGylated 

peptide 19 (SEQ ID NO: 3). 
2 0 Figure 6 shows a synthetic scheme for the preparation of PEGylated 

peptide 20 (SEQ ID NO: 4). 

Figure 7 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 5 and 6, respectively) of the molecule identified as "Fc-TMP" in 

Example 2 hereinafter. 
2 5 Figure 8 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 7 and 8, respectively) of the molecule identified as "Fc-TMP-TMP" in 
Example 2 hereinafter. . " 
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Figure 9 shows the nucleotide and amino add sequences (SEQ. ID. 
NOS: 9 and 10, respectively) of the molecule identified as ''TMP-TMP-Fc" 

in Example 2 hereinafter. 

Figure 10 shows the nucleotide and amino acid sequences (SEQ. ID. 
5 NOS: 11 and 12, respectively) of the molecule identified as "TMP-Fc" in 

Example 2 hereinafter. 

Figure 11 shows the number of platelets generated in vivo in 
normal female BDF1 mice treated with one 100 ng/kg bolus injection of 
various compounds, with the terms defined as follows. 
0 PEG-MGDF: 20 kD average molecular weight PEG attached by 

reductive amination to the N-terminal amino group of amino 
acids 1-163 of native human TPO, which is expressed in E. coli 
(so that it is not glycosylated); 
TMP: the TPO-mimetic peptide having the amino add sequence 
5 IEGFTLRQWLAARA (SEQ ID NO: 13); 

TMP-TMP: the TPO-mimetic peptide having the amino add 

» 

sequence ffiGPTLRQWLAARA-GGGGGGGG- 
IEGPTLRQWLAARA (SEQ ID NO: 14); 
PEG-TMP-TMP: the peptide of SEQ ID NO: 14, wherein the PEG 
: o group is a 5 kD average molecular weight PEG attached as 

shown in Figure 6; 
Fc-TMP-TMP: the compound of SEQ ID NO: 8 (Figure 8) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 
are bound to the corresponding Cys residues in the second 
. 5 monomer to form a dimer, as shown in Figure 2); and 

TMP-TMP-Fc is the compound of SEQ ID NO: 10 (Figure 9) 

dimerized in the same way as TMP-TMP-Fc except that the Fc . • 
domain is attached at the C-terminal end rather than the N- 
terminal end of the TMP-TMP peptide. 
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Figure 12 shows the number of platelets generated in vivo in 
normal BDF1 mice treated with various compounds delivered via 
implanted osmotic pumps over a 7-day period. The compounds are as 

defined for Figure 7. 
5 Figure 13 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 15 and 16, respectively) of the molecule identified as "Fc-EMP" in 

Example 3 hereinafter. 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 17 and 18, respectively) of the molecule identified as "EMP-Fc" in 

1 0 Example 3 hereinafter. 

Figure 15 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:19 and 20, respectively) of the molecule identified as "EMP-EMP-Fc" 
in Example 3 hereinafter. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
1 5 NOS: 21 and 22, respectively) of the molecule identified as "Fc-EMP-EMP" 
in Example 3 hereinafter. 

Figures 17A and 17B show the DNA sequence (SEQ ID NO: 23) 
inserted into pCFM1656 between the unique Aatn (position #4364 in 
pCFM1656) and SacH (position #4585 in pCFM1656) restriction sites to 
2 0 form expression plasmid pAMG21 (ATCC accession no. 98113). 

Figure 18A shows the hemoglobin, red blood cells, and hematocrit 
generated in vivo in normal female BDF1 mice treated with one 100 ng/kg 
bolus injection of various compounds. Figure 18B shows the same results 
with mice treated with 100 ug/kg per day delivered the same dooc by 7- 
2 5 day micro-osmotic pump with the EMPs delivered at 100 yg/kg, rhEPO at 
30U/mouse. (In both experiments, neutrophils, lymphocytes, and platelets 
were unaffected.) In these figures, the terms are defined as follows. 

Fc-EMP: the compound of SEQ ID NO: 16 (Figure 13) dimerized 

with an identical second monomer (i.e., Cys residues 7 and 10 are 
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bound to the corresponding Cys residues in the second monomer to 
form a dimer, as shown in Figure 2); 

EMP-Fc: the compound of SEQ ID NO: 18 (Figure 14) dimerized in 
the same way as Fc-EMP except that the Fc domain is attached at 
5 the C-terminal end rather than the N-terminai end of the EMP 

peptide. 

"EMP-EMP-Fc" refers to a tandem repeat of the same peptide (SEQ 
ID NO: 20) attached to the same Fc domain by the carboxyl 
terminus of the peptides. "Fc-EMP-EMP" refers to the same tandem 
1 0 repeat of the peptide but with the same Fc domain attached at the 

amino terminus of the tandem repeat. All molecules are expressed 
in E. coli and so are not glycosylated. 

Figures 19A and 19B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1055 and 1056) of the Fc-TNF-a inhibitor fusion molecule 
1 5 described in Example 4 hereinafter. 

Figures 20A and 20B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1057 and 1058) of the TNF-a inhibitor-Fc fusion molecule 
described in Example 4 hereinafter. 

Figures 21 A and 21B show the nucleotide and amino acid sequences 
2 0 (SEQ ID NOS: 1059 and 1060) of the Fc-IL-1 antagonist fusion molecule 
described in Example 5 hereinafter. 

Figures 22A and 22B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1061 and 1062) of the IL-1 antagonist-Fc fusion molecule 
described in Example 5 hereinafter. 
2 5 Figures 23 A, 23B, and 23C show the nucleotide and amino acid 

sequences (SEQ ID NOS: 1063 and 1064) of the Fc-VEGF antagonist fusion 
molecule described in Example 6 hereinafter. "~ 



it 
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Figures 24A and 24B show the nucleotide and amino add sequences 
(SEQ ID NOS: 1065 and 1066) of the VEGF antagonist-Fc fusion molecule 
described in Example 6 hereinafter. 

Figures 25A and 25B show the nucleotide and amino acid sequences 
5 (SEQ ID NOS: 1067 and 1068) of the Fc-MMP inhibitor fusion molecule 
described in Example 7 hereinafter. 

Figures 26A and 26B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1069 and 1070) of the MMP inhibitor-Fc fusion molecule 

■ 

described in Example 7 hereinafter. 

1 o Detailed Description of the Invention 

Definition of Terms 

The terms used throughout this specification are defined as follows, 
unless otherwise limited in specific instances. 

The term "comprising" means that a compound may include 
1 5 additional amino acids on either or both of the N- or C- termini of the 
given sequence. Of course, these additional amino acids should not 
significantly interfere with the activity of the compound. 

The term "vehicle 7 ' refers to a molecule that prevents degradation 
and/or increases half-life, reduces toxicity, reduces immunogenicity, or 

2 0 increases biological activity of a therapeutic protein. Exemplary vehicles 

include an Fc domain (which is preferred) as well as a linear polymer (e.g., 
polyethylene glycol (PEG), polylysine, dextran, etc.); a branched-chain 
polymer (see, for example, US. Patent No. 4,289,872 to Denkenwalter et 
ah, issued September 15, 1981; 5,229,490 to Tarn, issued July 20, 1993; WO 
2 5 93/21259 by Frechet etal., published 28 October 1993); a lipid; a 

cholesterol group (such as a steroid); a carbohydrate or oligosaccharide; or 
any natural or synthetic protein, polypeptide or peptide that binds to a 
salvage receptor. Vehicles are further described hereinafter. 



17 
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The term "native Fc" refers to molecule or sequence comprising the 
sequence of a non-antigen-binding fragment resulting from digestion of 
whole antibody, whether in monomeric or multimeric form. The original 
immunoglobulin source of the native Fc is preferably of human origin and 
5 may be any of the immunoglobulins, although IgGl and IgG2 are 

preferred. Native Fc's are made up of monomeric polypeptides that may 
be linked into dimeric or multimeric forms by covalent (i.e., disulfide 
bonds) and non-covalent association. The number of intermolecular 
disulfide bonds between monomeric subunits of native Fc molecules 

1 0 ranges from 1 to 4 depending on class (e.g., IgG, IgA, IgE) or subclass (e.g., 
IgGl, IgG2, IgG3, IgAl, IgGA2). One example of a native Fc is a disulfide- 
bonded dimer resulting from papain digestion of an IgG (see Ellison et al . 
(1982), Nucleic Acids Res . 10: 4071-9). The term "native Fc" as used herein 
is generic to the monomeric, dimeric, and multimeric forms. 

1 5 The term "Fc variant" refers to a molecule or sequence that is 

modified from a native Fc but still comprises a binding site for the salvage 
receptor, FcRn. International applications WO 97/34631 (published 25 
September 1997) and WO 96/32478 describe exemplary Fc variants, as 
well as interaction with the salvage receptor, and are hereby incorporated 

20 by reference. Thus, the term "Fc variant" comprises a molecule or 

sequence that is humanized from a non-human native Fc. Furthermore, a 
native Fc comprises sites that may be removed because they provide 
structural features or biological activity that are not required for the fusion 
molecules of the present invention. Thus, the term "Fc variant" comprises 

25 a molecule or sequence that lacks one or more native Fc sites or residues 
that affect or are involved in (1) disulfide bond formation, (2) 
incompatibility with a selected host cell (3) N-terminal heterogeneity upon , 
expression in a selected host cell, (4) glycosylation, (5) interaction with 
complement, (6) binding to an Fc receptor other than a salvage receptor, or 
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(7) antibody-dependent cellular cytotoxicity (ADCC). Fc variants are 
described in further detail hereinafter. 

The term "Fc domain" encompasses native Fc and Fc variant 
molecules and sequences as defined above. As with Fc variants and native 
5 Fc's, the term "Fc domain" includes molecules in monomeric or 

multimeric form, whether digested from whole antibody or produced by 
other means. 

The term "multimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two or more 

1 0 polypeptide chains associated covalently, noncovalently, or by both 
covalent and non-covalent interactions. IgG molecules typically form 
dimers; IgM, pentamers; IgD, dimers; and IgA, monomers, dimers, 
trimers, or tetramers. Multimers may be formed by exploiting the 
sequence and resulting activity of the native Ig source of the Fc or by 

1 5 derivatizing (as defined below) such a native Fc. 

The term "dimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two polypeptide chains 
associated covalently or non-covalently. Thus, exemplary dimers within 
the scope of this invention are as shown in Figure 2. 

2 0 The terms "derivatizing" and "derivative" or "derivatized" 

comprise processes and resulting compounds respectively in which (1) the 
compound has a cyclic portion; for example, cross-linking between 
cysteinyl residues within the compound; (2) the compound is cross-linked 
or has a cross-linking site; for example, the compound has a cysteinyl 

2 5 residue and thus forms cross-linked dimers in culture or in vivo; (3) one or 
more peptidyl linkage is replaced by a non-peptidyl linkage; (4) the N- 
termtnus is replaced by -NRR 1 , NRC(0)R l , -NRCCOOR^NRSCO)^ 1 , - 
NHC(0)NHR, a succinimide group, or substituted or unsubstituted 
benzyloxycarbonyl-NH-, wherein R and R 1 and the ring substituents are 
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as defined hereinafter; (5) the C-terminus is replaced by -C(0)R 2 or -NR 3 R 4 
wherein R 2 ,R 3 and R 4 are as defined hereinafter; and (6) compounds in 
which individual amino acid moieties are modified through treatment 
with agents capable of reacting with selected side chains or terminal 
5 residues. Derivatives are further described hereinafter. 

The term "peptide" refers to molecules of 2 to 40 amino acids, with 
molecules of 3 to 20 amino acids preferred and those of 6 to 15 amino acids 
most preferred. Exemplary peptides may be randomly generated by any 
of the methods cited above, carried in a peptide library (e.g., a phage 

1 0 display library), or derived by digestion of proteins. 

The term "randomized" as used to refer to peptide sequences refers 
to fully random sequences (e.g., selected by phage display methods) and 
sequences in which one or more residues of a naturally occurring molecule 
is replaced by an amino acid residue not appearing in that position in the 

1 5 naturally occurring molecule. Exemplary methods for identifying peptide 
sequences include phage display, E. coli display, ribosome display, RNA- 
peptide screening, chemical screening, and the like. 

The term "pharmacologically active" means that a substance so 
described is determined to have activity that affects a medical parameter 

2 0 (e.g., blood pressure, blood cell count, cholesterol level) or disease state 
(e.g., cancer, autoimmune disorders). Thus, pharmacologically active 
peptides comprise agonistic or mimetic and antagonistic peptides as 

« 

defined below. 

The terms "-mimetic peptide" and "-agonist peptide" refer to a 
2 5 peptide having biological activity comparable to a protein (e.g., EPO, TPO, 
G-CSF) that interacts with a protein of interest. These terms further 
include peptides that indirectly mimic the activity of a protein of interest, 
such as by potentiating the effects of the natural ligand of the protein of 
interest; see, for example, the G-CSF-mimetic peptides listed in Tables 2 
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and 7. Thus, the term "EPO-mimetic peptide" comprises any peptides that 
can be identified or derived as described in Wrighton et al. (1996), Science 
273: 458-63, Naranda etal (1999), Proc. Natl. Acad. Sci. USA 96: 7569-74, 
or any other reference in Table 2 identified as having EPO-mimetic subject 
5 matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. 

The term "TPO-mimetic peptide" comprises peptides that can be 
1 0 identified or derived as described in Cwirla etal. (1997), Science 276: 1696- 
9 , U.S. Pat Nos. 5,869,451 and 5,932,946 and any other reference in Table 2 
identifed as having TPO-mimetic subject matter, as well as the U.S. patent 
application, "Thrombopoietic Compounds/' filed on even date herewith 
and hereby incorporated by reference. Those of ordinary skill in the art 
1 5 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 

■ 

procedures with different peptide libraries. 

The term "G-CSF-numetic peptide" comprises any peptides that 
can be identified or described in Paukovits etal . (1984), Ho ppe-Sevlers Z. 

2 0 Phvsiol. Chem . 365: 303-1 1 or any of the references in Table 2 identified as 
having G-CSF-mimetic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides man actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

2 5 The term "CTLA4-mimetic peptide" comprises any peptides that 

can be identified or derived as described in Fukumoto etal. (1998), Nature 
Biotech . 16: 267-70. Those of ordinary skill in the art appreeiateJthat each of , 
these references enables one to select different peptides than actually 
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disclosed therein by following the disclosed procedures with different 
peptide libraries. 

The term "-antagonist peptide" or "inhibitor peptide" refers to a 
peptide that blocks or in some way interferes with the biological activity of 
5 the associated protein of interest, or has biological activity comparable to a 
known antagonist or inhibitor of the associated protein of interest. Thus, 
the term "TNF-antagonist peptide" comprises peptides that can be 
identified or derived as described in Takasaki etal. (1997), Nature Biotech . 
15: 1266-70 or any of the references in Table 2 identified as having TNF- 

1 0 antagonistic subject matter. Those of ordinary skill in the art appreciate 
that each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

The terms "IL-1 antagonist" and "IL-lra-mimetic peptide" 

1 5 comprises peptides that inhibit or down-regulate activation of the IL-1 
receptor by IL-1. IL-1 receptor activation results from formation of a 
complex among IL-1, IL-1 receptor, and IL-1 receptor accessory protein. 
IL-1 antagonist or IL-lra-mimetic peptides bind to IL-1, IL-1 receptor, or 
IL-1 receptor accessory protein and obstruct complex formation among 

2 0 any two or three components of the complex. Exemplary IL-1 antagonist 
or IL-lra-mimetic peptides can be identified or derived as described in 
U.S. Pat. Nos. 5,608,035, 5,786,331, 5,880,096, or any of the references in 
Table 2 identified as having IL-lra-mimetic or IL-1 antagonistic subject 
matter. Those of ordinary skill in the art appreciate that each of these 

2 5 references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 

libraries. " 

The term "VEGF-antagonist peptide" comprises peptides that can 
be identified or derived as described in Fairbrother (1998), Biochem. 37: 
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17754-64, and in any of the references in Table 2 identified as having 
VEGF-antagonistic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
5 procedures with different peptide libraries. 

The term "MMP inhibitor peptide" comprises peptides that can be 
identified or derived as described in Koivunen (1999), Nature Biotech. 17: 
768-74 and in any of the references in Table 2 identified as having MMP 
inhibitory subject matter. Those of ordinary skill in the art appreciate that 

1 0 each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

Additionally, physiologically acceptable salts of the compounds of 
this invention are also encompassed herein. By "physiologically 

1 5 acceptable salts" is meant any salts that are known or later discovered to 
be pharmaceutically acceptable. Some specific examples are: acetate; 
trifluoroacetate; hydrohalides, such as hydrochloride and hydrobromide; 
sulfate; citrate; tartrate; glycolate; and oxalate. 
Structure of compounds 

2 o In General . In the compositions of matter prepared in accordance 

with this invention, the peptide may be attached to the vehicle through the 
peptide's N-terminus or C-terminus. Thus, the vehicle-peptide molecules 
of this invention may be described by the following formula I: 

I 

25 

wherein: 

F 1 is a vehicle (preferably an Fc domain); 

X 1 and X J are each independently selected from -(V)-T\ -(L 1 ) c -P 1 - 

(L\ -P 2 , -(L'VP'-aVP 2 -^).-^ and ^V^V^ 

V 
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P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; _ 

V, V, V, and L 4 are each independently linkers; and 

a, b, c, d, e, and f are each independently 0 or 1, provided that at 

5 least one of a and b is 1. 

Thus, compound I comprises preferred compounds of the formulae 

n 

X 1 -F 

and mul timers thereof wherein F 1 is an Fc domain and is attached at the C- 
1 0 terminus of X'; 

m 

r-x 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of X 2 ; 
15 TV 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -Qjy-V 1 ; and 

V 

20 FMlA-P'-fl-VP 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -L'-P'-L'-P 2 . 

Peptides . Any number of peptides may be used in conjunction with 
the present invention. Of particular interest are peptides that mimic the 
2 5 activity of EPO, TPO, growth hormone, G-CSF, GM-CSF, IL-lra, leptin, 
CTLA4, TRAIL, TGF-a, and TGF-p. Peptide antagonists are also of 
interest particularly those antagonistic to the activity of TNF, leptin, any 
of the interleukins QL-1, 2, 3, ...), and proteins involved in complement 
activation (e.g., C3b). Targeting peptides are also of interest, including 
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tumor-homing peptides, membrane-transporting peptides, and the like. 
All of these classes of peptides may be discovered by methods described in 
the references cited in this specification and other references. 

Phage display, in particular, is useful in generating peptides for use 
5 in the present invention. It has been stated that affinity selection from 
libraries of random peptides can be used to identify peptide ligands for 
any site of any gene product. Dedman et al . (1993), T. Biol Chem. 268: 
23025-30. Phage display is particularly well suited for identifying peptides 
that bind to such proteins of interest as cell surface receptors or any 

1 0 proteins having linear epitopes. Wilson etal (1998), Can. T. Microbiol. 44: 
313-29; Kay etal. (1998), Drug Disc. Today 3: 370-8. Such proteins are 
extensively reviewed in Herz etal. (1997), T. Receptor & Signal 
Transduction Res . 17(5): 671-776, which is hereby incorporated by 
reference. Such proteins of interest are preferred for use in this invention. 

15 A particularly preferred group of peptides are those that bind to 

cytokine receptors. Cytokines have recently been classified according to 
their receptor code. See Inglot (1997), Archivum Im munologiae et 
Therapiae Experimentalis 45: 353-7, which is hereby incorporated by 
reference. Among these receptors, most preferred are the CKRs (family I in 

2 0 Table 3). The receptor classification appears in Table 3. 
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Table 3— Cytokine Receptors Classified by Receptor Code 



- Cytokines (ligands) 


Receptor Type 


family subfamily 


family subfamily 


1. Hematopoietic 1. IL-2.IL-4.IL-7, 
cytokines IL-9, IL-13. IL- 

15 

2. IL-3. IL-5, GM- 
CSF 

n ii ft |i i 1 II . 
O. IL-O, IL-1 1, IL- 

12, LIF, OSM, 

HMTF lAntin 

(OB) 
4. G-CSF.EPO. 
TPO. PRL, GH 

«i II -17 HVS-IL- 

3. I U l*| 11 V O IL. 

17 


I: oytoKine n i . snareu yor 
(CKR) 

2. shared GP 140 
PR 

3 3 shared RP 
130 

4. "single chain" R 

5. other R e 


II. IL-10 ligands IL-10, BCRF-1, 

HSV-IL-10 


II. IL-10 R 


111. Interferons 1. IFN-al, a2, a4 t 

m, t, IFN-p" 
2. IFN-y 


III. Interferon R 1. IFNAR 

2. IFNGR 


IV. IL-1 ligands 1. IL-1 a, IL-ip, IL- 

1Ra 


IV. IL-1R 


V. TNF ligands 1. TNF-a, TNF-p 

(LT),FAS1, 
CD40 L, 
CD30L, CD27 L 


V. NGF/TNF R* 


VI. Chemokines 1. a chemokines: 

IL-8, GRO 
a,p,Y, IF-10, 
PF-4, SDF-1 

2. p chemokines: 
MIP1a, MIPip, 
MCP-1 ,2,3,4, 
RANTES, 
eotaxin 

3. r chemokines: 
lymphotactin 


VI. ChemokineR 1. CXCR 

2. CCR 

3. CR 

4. DARC 



c IL-17R belongs to the CKR family but is not assigned to any of the 4 indicated subjamilies. 
d Other I FN type I subtypes remain unasslgned. Hematopoietic cytokines, IL-1 0 ligands and 
interferons do not possess functional intrinsic protein kinases. The signaling molecules for the 
cytokines are JAK's, STATs and related non-receptor molecules. IL-1 4, IL-1 6 and IL-1 8 have been 
cloned but according to the receptor code they remain unassigned. 
• TNF receptors use multiple, distinct intracellular molecules for signal transduction including 
"death domain" of FAS R and 55 kDa TNF-aR that participates in their cytotoxic effects. NGF/TNF 
R can bind both NGF and related factors as well as TNF ligands. Chemokine receptors are G 
protein-coupled, seven transmembrane (7TM, serpentine) domain receptors. 
r The Duffy blood group antigen (DARC) is an erythrocyte receptor that can bind several different 
chemokines. It belongs to the immunoglobulin superfamily but characteristics of its signal 
transduction events remain unclear. 



WO 00/24782 



PCT/US99/25044 



VII. Growth factors 

1.1 SCF.M-CSF, 
PDGF-AA, AB, 
BB, FLT-3L, 
VEGF, SSV- 
PDGF 

1.2 FGFa, FGFB 
1.3EGF.TGF-0, 

W-F19(EGF- 
like) 

1.4IGF-I, IGF-II, 
Insulin 

1.5 NGF, BDNF, 
NT-3, NT-4" 
2. TGF-P1,P2,B3 



VII. RKF 1. TK sub-family 

1.1 IgTKIIIR 



1.2 IgTKIVR 

1.3 Cysteine-rich 
TK-I 

1.4 Cysteine rich 
TK-II 

1.5 Cysteine knot 
TK V 

2. STK subfamily" 



Exemplary peptides for this invention appear in Tables 4 through 
20 below. These peptides may be prepared by methods disclosed in the 
5 art. Single letter amino acid abbreviations are used. The X in these 

sequences (and throughout this specification, unless specified otherwise in 
a particular instance) means that any of the 20 naturally occurring amino 
acid residues may be present. Any of these peptides may be linked in 
tandem (i.e., sequentially), with or without linkers, and a few tandem- 

1 0 linked examples are provided in the table. Linkers are listed as "A" and 
may be any of the linkers described herein. Tandem repeats and linkers 
are shown separated by dashes for clarity. Any peptide containing a 
cysteinyl residue may be cross-linked with another Cys-containing 
peptide, either or both of which may be linked to a vehicle. A few cross- 

1 5 linked examples are provided in the table. Any peptide having more than 
one Cys residue may form an intrapeptide disulfide bond, as well; see, for 
example, EPOmimetic peptides in Table 5. A few examples of 
intrapeptide disulfide-bonded peptides are specified in the table. Any of 
these peptides may be derivatized as described herein, and a few 

2 0 derivatized examples are provided in the table. Derivatized peptides in 



9 The neurotrophic cytokines can associate with NGF/TNF receptors also. 
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the tables are exemplary rather than limiting, as the associated 
underivatized peptides may be employed in this invention, as well. For 
derivatives in which the carboxyl terminus may be capped with an amino 
group, the capping amino group is shown as -NH^. For derivatives in 
5 which amino acid residues are substituted by moieties other than amino 
acid residues, the substitutions are denoted by a, which signifies any of 
the moieties described in Bhatnagar etal. (1996), T. Med. Chem . 39: 3814-9 
and Cuthbertson etal. (1997), T. Med. Chem . 40: 2876-82, which are 
incorporated by reference. The J substituent and the Z substituents (Zy Z^ 

10 . . .ZJ are as defined in U.S. Pat. Nos. 5,608,035 ,5,786,331, and 5,880,096, 
which are incorporated by reference. For the EPO-mimetic sequences 
(Table 5), the substituents ^ through X n and the integer "n" are as defined 
in WO 96/40772, which is incorporated by reference. The substituents 
"0," and "+" are as defined in Sparks etal. (1996), Proc. Natl. Acad. Sci . 93: 

1 5 1540-4, which is hereby incorporated by reference. X 4 , X^ and X, are as 
defined in U.S. Pat. No. 5,773,569, which is hereby incorporated by 
reference, except that: for integrin-binding peptides, X 1/ X 2 , X y X«, X^ X<, X 7 , 
and X 8 are as defined in International applications WO 95/14714, 
published June 1, 1995 and WO 97/08203, published March 6, 1997, which 

20 are also incorporated by reference; and for VIP-mimetic peptides, X,, X/, 
X/*, X 2 , X y X„ X 5 , X 6 and Z and the integers m and n are as defined in WO 
97/40070, published October 30, 1997, which is also incorporated by 
reference. Xaa and Yaa below are as defined in WO 98/09985, published 
March 12, 1998, which is incorporated by reference. AA„ AAj, AB„ AB 2 , 

2 5 and AC are as defined in International application WO 98/53842, 

published December 3, 1998, which is incorporated by reference. X 1 , X 2 , X 3 , 
and X 4 in Table 17 only are as defined in European application EP 0 911 

h STKS may encompass many other TGF-p-related factors that remain unassigned. The protein 
kinases are intrinsic part of the intracellular domain of receptor kinase family (RKF). The enzymes 
participate in the signals transmission via the receptors. 
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393, published April 28, 1999. Residues appearing in boldface are D- 
amino acids. All peptides are linked through peptide bonds unless 
otherwise noted. Abbreviations are listed at the end of this specification. In 
the "SEQ ID NO." column, "NR" means that no sequence listing is required 
5 for the given sequence. 



Table 4 — IL-1 antagonist peptide sequences 



JCUUullC/ a 11 Ul IUXC 


SEO 

IU IN us 


Z^.QZ.YZZZ.. 


212 


XXQZ^YZpXX 


907 


Z 7 XQZ I5 YZ ft XX 




Z^.QZ.YZ.ZZ,,, 


Ana 

909 


Z„Z^„QZ.YZZZ n 


910 


Z, ;> Z„Z 1 .Z„Z 1s Z, 7 Z, a Z,^ n ZMZ^„ZZ,QZ s YZ (! Z 0 Z in L 


917 


"7 A. | *j —9 —9 —j 


979 


TANVSSFEWTPYYWQPYALPL 


213 


SWTDYGYWQPYALPISGL 


214 


ETPFTWEESNAYYWQPYALPL 


215 


ENTYSPNWADSMYWQPYALPL 


216 


SVGEDHNFVVTSEYWQPYALPL 


21/ 




218 


FEWTPGYWQPY 


219 


FEWTPGYWQHY 


220 


FEWTPGWYQJY 


221 


AcFEWTPGWYQJY 


222 


FEWTPGWpYQJY 


223 


FAWTPGYWQJY 


224 


FEWAPGYWQJY 


225 


FEWVPGYWQJY 


226 


FEWTPGYWQJY 


227 


AcFEWTPGYWQJY 


228 


FEWTPaWYQJY 


229 


FEWTPSarWYQJY 


230 


FEWTPGYYQPY 


231 


FEWTPGWWQPY 


232 


FEWTPNYWQPY 


233 


FEWTPvYWQJY 


234 


FEWTPecGYWQJY 


235 


FEWTPAibYWQJY 


236 


FEWTSarGYWQJY 


237 


FEWTPGYWQPY 


238 


FEWTPGYWQHY 


239 


FEWTPGWYQJY 


240 
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AcFEWTPGWYQJY 


241 


FEWTPGW-pY-QJY 


242 


FAWTPGYWQJY 


_ 243_ 


FEWAPGYWQJY 


244 


FEWVPGYWQJY 


245 


FEWTPGYWQJY 


246 


AcFEWTPGYWQJY 


247 


FEWTPAWYQJY 


248 


FEWTPSarWYQJY 


249 


FEWTPGYYQPY 


250 


FEWTPGWWQPY 


251 


FEWTPNYWQPY 


252 


FEWTPVYWQJY 


253 


FEWTPecGYWQJY 


254 


FEWTPAibYWQJY 


255 


FEWTSarGYWQJY 


256 


FEWTPGYWQPYALPL 


257 


1 NapEWTPGYYQJY 


258 


YEWTPGYYQJY 


259 


FEWVPGYYQJY 


260 


FEWTPSYYQJY 


261 


FEWTPNYYQJY 


262 


TKPR 


263 


RKSSK 


264 


RKQDK 


265 


NRKQDK 


266 


RKQDKR 


267 


ENRKQDKRF 


268 


VTKFYF 


269 


VTKFY 


270 


VTDFY 


271 


SHLYWQPYSVQ 


671 


TLVYWQPYSLQT 


672 


RGDYWQPYSVQS 


673 


VHVYWQPYSVQT 


674 


RLVYWQPYSVQT 


675 


SRVWFQPYSLQS 


676 


NMVYWQPYSIQT 


677 


SWFWQPYSVQT 


678 


TFVYWQPYALPL 


679 


TLVYWQPYSIQR 


680 


RLVYWQPYSVQR 


681 


SPVFWQPYSIQI 


682 


WIEWWQPYSVQS 


683 


SLIYWQPYSLQM 


684 


TRLYWQPYSVQR 


~ 685 


RCDYWQPYSVQT 


686 


MRVFWQPYSVQN 


687 


KIVYWQPYSVQT 


688 


RHLYWQPYSVQR 


689 



lb 
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ALVWWQPYSEQI 


Aon 


SRVWFQPYSLQS 


ovi 


WEQPYALPLE 




QLVWWQPYSVQR 




DLRYWQPYSVQV 




ELVWWQPYSLQL 


D7D 


DLVWWQPYSVQW 


WO 


NGNYWQPYSFQV 




ELVYWQPYSIQR 


Wo 


ELMYWQPYSVQE 




NLLYWQPYSMQD 


/uu 


GYEWYQPYSVQR 


/Ui 


SRVWYQPYSVQR 


/uz 


LSEQYQPYSVQR 


/IA> 


GGGWWQPYSVQR 




VGRWYQPYSVQR 


/ID 


VHVYWQPYSVQR 


/Uo 


QARWYQPYSVGR 


/U/ 


VHVYWQPYSVQT 


7Uo 


RSVYWQPYSVQR 


709 


TRVWFQPYSVQR 


710 


GRIWFQPYSVQR 


71 i 


GRVWFQPYSVQR 


712 


ARTWYQPYSVQR 


713 


ARVWWQPYSVQM 


714 


RLMFYQPYSVQR 


715 


ESMWYQPYSVQR 


716 


HFGWWQPYSVHM 


717 


ARFWWQPYSVQR 


718 


RLVYWQ PYAPIY 


719 


RLVYWQ PYSYQT 


720 


RLVYWQ PYSLPI 


721 


RLVYWQ PYSVQA 


722 


SRVWYQ PYAKGL 


723 


SRVWYQ PYAQGL 


724 


SRVWYQ PYAMPL 


723 


SRVWYQ PYSVQA 


/2o 


SRVWYQ PYSLGL 


7Z/ 


SRVWYQ PYAREL 


7ZO 


SRVWYQ PYSRQP 


/Z7 


CD\AA/VH PVPVOP 


730 


EYEWYQ PYALPL 


731 


IPEYWQ PYALPL 


732 


SRIWWQ PYALPL 


733 


DPLFWQ PYALPL 


734 


SRQWVQ PYALPL 


735 


IRSWWQ PYALPL 


736 


RGYWQ PYALPL 


737 


RLLWVQ PYALPL 


738 


EYRWFQ PYALPL 


739 



3| 
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DAYWVQ PYALPL 


740 


WSGYFQ PYALPL 


741 


NIEFWQ PYALPL 


742 


TRDWVQ PYALPL 


743 


DSSWYQ PYALPL 


744 


IGNWYQ PYALPL 


745 


NLRWDQ PYALPL 


746 


LPEFWQ PYALPL 


747 


DSYWWQ PYALPL 


748 


RSQYYQ PYALPL 


749 


ARFWLQ PYALPL 


750 


NSYFWQ PYALPL 


751 


RFMYWQPYSVQR 


752 


AHLFWQPYSVQR 


753 


WWQPYALPL 


754 


YYQPYALPL 


755 


YFQPYALGL 


756 


YWYQPYALPL 


757 


RWWQPYATPL 


758 


GWYQPYALGF 


759 


YWYQPYALGL 


760 


IWYQPYAMPL 


761 


SNMQPYQRLS 


762 


TFVYWQPY AVGLPAAETACN 


763 


TFVYWQPY SVQMTITGKVTM 


764 


TFVYWQPY SSHXXVPXGFPL 


765 


TFVYWQPY YGNPQWAIHVRH 


766 


TFVYWQPY VLLELPEGAVRA 


767 


TFVYWQPY VDYVWPIPIAQV 


768 


GWYQPYVDGWR 


769 


RWEQPYVKDGWS 


770 


EWYQPYALGWAR 


771 


GWWQPYARGL 


772 


LFEQPYAKALGL 


773 


GWEQPYARGLAG 


774 


AWVQPYATPLDE 


775 


MWYQPYSSQPAE 


776 


GWTQPYSQQGEV 


777 


DWFQPYSIQSDE 


778 


PWIQPYARGFG 


779 


RPLYWQPYSVQV 


780 


TUYWQPYSVQI 


781 


RFDYWQPYSDQT 


782 


WHQFVQPYALPL 


783 


EWDS VYWQPYSVQ TLLR 


784 


WEQN VYWQPYSVQ SFAD 


785 


SDV VYWQPYSVQ SLEM 


786 


YYDG VYWQPYSVQ VMPA 


787 


SDIWYQ PYALPL 


788 


QRIWWQ PYALPL 


789 
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QRIWWD PYAI PI 


790 


OQI YWO PYAI PI 


791 


TIIWFO PYAI PI 


792 


WPTWYO PYAI PL 


793 


c:Yn\A/PD PYAI PI 


794 


QRIWPH PYAI PL 

OrilVVVylV r TnLr L 


795 


FIMFWO PYALPL 

CMVIr WW r inLrL 


796 


nvvwnn pyalpi 

U i V VVVjIVx i T f\Lr U 


797 


MHI 1 \/n WYHPYAI PI 

IV1L/L,L_ V V*j/ VV I Ur TnLrL 


798 


i^QkV/ll WYHPYAI PI 
VaOIWIL VYTwr TnLrL 


799 


QrYftAMI WYHPYAI PI 
nVJVjMINI VVTVJr TnLr L 


800 


i^i^r^npp wyhpyai pi 

UuvJULr VVTVJr TnLrL 


801 


QHI PRT WYHPYAI PI 
OvJLCrf 1 VVTUr TnLrL 


802 


CTWV/DP WVOPYAI PI 
1 1 WVnC WTUr TnLrL 


803 


KVi^QTO WVOPYAI PI 
ftrMjO 1 vW W T wr T MLr L 


804 


1 rtADMM \A/Vr>.PVAI PI 
LUnPllvllN VVTVJr TMLrL 


805 


[TDDOrW WVi^PVAl PI 


806 


X/lrrWlA/Q \A/VOPVAI PI 
VrxUrxVVn VVTvJrYALrL 


807 ! 


1 DDUHV \A/VOPVAI PI 
LnhnUV VVTVJr TnLrL 


808 


DCTAOI lAIVr^DVAl Dl 

Ho 1 Aol VVTUrTMLrL 


809 


CPI/CHA \A/Vr^DVAI Dl 

coKcUU VVYUPYALPL 


810 


EGLTMK WYQPYALPL 


fin 


EGSRcCi VVYUPYALPL 




VIEWWQ PYALPL 


813 


\/Uf\/IA/m DV/AI Dl 

VWYWEQ PYALPL 


814 


ASEWWQ PYALPL 




t-v/c i a n * ir\ DVAI Dl 

FtEWWQ PYALPL m _., 


816 

Oil/ 


tGWVVVU PYALPL 


817 


1 > !/«■% r"| A/1 /"\ DV/AI Dl 

WGEWLQ PYALPL 


818 


n\/\AA/cn DN/AI Dl 

DYVWcU PYALPL 


819 


AI_IT1AnA#r> DVAI Dl 

AnTWVVvJ PYALPL 


820 


rlbWrU PYALrL 


821 


IA/I AlA/C/"\ DVAI Dl 

WLAWtU PYALrL 


822 


\/hAC\At\Air\ DVAI Di 
VMfcWWvJ rYALrL 


823 


CDK>l\A/0 DVAI PI 
tHMWU r TnLrL 


824 


MYYWYY PYAI PI 
INAAVVAA r TnLrL 


825 


VAVf^MWYO PYAI PI 
VVUiliVVTU r TnLrL 


826 


Tl VAAiprj PYAI PI 


827 


V/WRWPO PYAI PI 


828 


1 1 WTO PVAI PI 
LLW lUr T nLr L 


829 


SRIWXX PYALPL 


830 


SDIWYQ PYALPL 


831 


WGYYXX PYALPL 


832 


TSGWYQ PYALPL 


833 


VHPYXX PYALPL 


834 


EHSYFQ PYALPL 


~ 835 


XXIWYQ PYALPL 


836 


AQLHSQ PYALPL 


837 


WANWFQ PYALPL 


838 


SRLYSQ PYALPL 


839 
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1 GVTFSQ PYALPL 


840 


SIVWSQ PYALPL 


841 


SRDLVQ PYALPL - - - 


842 


HWGH VYWQPYSVQ DDLG 


843 


SWHS VYWQPYSVQ SVPE 


844 


WRDS VYWQPYSVQ PESA 


845 


TWDA VYWQPYSVQ KWLD 


846 


TPPW VYWQPYSVQ SLDP 


847 


YWSS VYWQPYSVQ SVHS 


848 


YWY QPY ALGL 


849 


YWY QPY ALPL 


850 


EWI QPY ATGL 


851 


NWE QPY AKPL 


852 


AFY QPY ALPL 


853 


FLY QPY ALPL 


854 


VCK QPY LEWC 


855 


ETPFTWEESNAYYWQPYALPL 


856 


QG WLTWQDSVDM YWQPYALP L 


857 


FSEAGYTWPENTYWQPYALPL 


858 


TESPGGLDWAKIYWQPYALPL 


859 


DGYDRWRQSGERYWQPYALPL 


860 


TANVSSFEWTPGYWQPYALPL 


861 


SVGEDHNFWTSE YWQPYALPL 


862 


MNDQTSEVSTFP YWQPYALPL 


863 


SWSEAFEQPRNL YWQPYALPL 


864 


QYAEPSALNDWG YWQPYALPL 


865 


NGDWATADWSNY YWQPYALPL 


866 


THDEHI YWQPYALPL 


867 


MLEKTYTTWTPG YWQPYALPL 


868 


WSDPLTRDADL YWQPYALPL 


869 


SDAFTTQDSQAM YWQPYALPL 


870 


GDDAAWRTDSLT YWQPYALPL 


871 


AIIRQLYRWSEM YWQPYALPL 


872 


ENTYSPNWADSM YWQPYALPL 


873 


MNDQTSEVSTFP YWQPYALPL 


874 


SVGEDHNFWTSE YWQPYALPL 


875 


QTPFTWEESNAY YWQPYALPL 


876 


ENPFTWQESNAY YWQPYALPL 


877 


VTPFTWEDSNVF YWQPYALPL 


878 


QIPFTWEQSNAY YWQPYALPL 


879 


QAPLTWQESAAY YWQPYALPL 


ooU 


EPTFTWEESKAT YWQPYALPL 


881 


TTTLTWEESNAY YWQPYALPL 


882 


ESPLTWEESSAL YWQPYALPL 


883 


ETPLTWEESNAY YWQPYALPL 


884 


EATFTWAESNAY YWQPYALPL 


— 885 


EALFTWKESTAY YWQPYALPL 


886 


STP-TWEESNAY YWQPYALPL 


887 


ETPFTWEESNAY YWQPYALPL 


888 


KAPFTWEESQAY YWQPYALPL 


889 
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STSFTWEESNAY YWQPYALPL 


890 


DSTFTWEESNAY YWQPYALPL 


891 


YIPFTWEESNAY YWQPYALPL 


892 


QTAFTWEESNAY YWQPYALPL 


893 


FTI FTWFESNAT YWQPYALPL 


894 


V^FTWPPSNAY YWQPYALPL 

V Owl 1 Vltuulin 1 l fiver i i— 


895 


DPYAI PI 


896 


1 Pv-1-NanPYOJYALPL 


897 


TANX/^FFWTPG YWQPYALPL 


898 


FFWTPfWWOPYALPL 


899 


FFWTPfiYWQJYALPL 


900 


PFWTPftYYQ IYA1 PI 


901 


PTPPTWPPQWAYYWnPYAI PI 


902 


mA/CPQMAYYWD IYAI PI 
F 1 WtCOINM T T VVVJJ T MLrL 


903 


HUVL Y VV\jr in r V 1 LVVV 


904 \ 


An\/AC VAA/r^DVA 1 PI TCI 
(jUVAfc YWUrYM LrLloL 


905 


OlAfTr^N/^ VAA/ODV A 1 DIQS2I 

OW 1 UYwi YWUrYA LrlovaL 


906 I 

7 W 


rcWTrCaYWUrYALrL 


911 

711 


CC\A/TD^\/\A/n IV A 1 Dl 

rcWl rCaYWUJ YALr L 


912 


FcWI rCaWYvJr YALrL 


91 

7 U 


rfcW 1 rvaWYUJ YALrL 


914 

✓ ATX 


rcVVTrCaYYUrYALrL , 


915 


crctArro^* w/^ ivai Dl 
rcWTPCaYYCJJ YALrL 


71U 


TAMV/CeCCU/TDPV\A/ADVAI Dl 

TANVSSrcWl rbYWUrYALrL 


918 

710 


C»\A/Tn\/^\/\A/AD\/A 1 DIG/21 

oWTDYGYWUrYALrlovaL 


919 


CTDCT\AfCCCMAW\A/ADVAI PI 

cTrr I WbfcolMAYYWvJr YALrL j 


920 1 


cKrrvcDM\A/Ancfciiv\A/npVAl Pi 

fclN I YoriMWAUolvl Y WVjr YMLTL 


921 ! 


C\/PCnUMC\A/TCPV\A/nPVAI PI 


922 


n^vnowDnQ^PRVW/nPYAi PI 
UbYUnWnUovaCnYVVur TnLrL 


923 


cciArrD/'SVW/OPVAl PI 

rtW 1 r\aT wur iMLrL 


924 


ppuv/Tprs v\A/rj pv 

rCW 1 ro T VVVJr T 


925 


CP\A/TP^Y\A/n. IY 


926 


PWTPfiYWHPY 

CVV 1 rO T VVUr T — 


927 


revv i rvavv t vju t 


928 


a cwTp^YWd 1 Y 

MCVV 1 rU i VVUJ I 


929 


FA\A/TPrtVW(l IY 


930 




931 


FF\A7 A Pf^YXA/n IY 


932 


rCW 1 AO T VVViU T 


933 


CCIA/TDAVWO. IV 


934 


FEWTPGAWQJY 


935 


FEWTPGYAQJY 


936 


FEWTPGYWQJA 


937 


FEWTGGYWQJY 


938 


FEWTPGYWQJY 


939 


FEWTJGYWQJY 


940 


FEWTPecGYWQJY 


941 


FEWTPAibYWQJY 


942 


FEWTPSarWYQJY 


943 


FEWTSarGYWQJY 


944 
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FEWTPNYWQJY 


945 


FEWTPVYWQJY 


946 


FEWTVPYWQJY - - 


947 


AcFEWTPGWYQJY 


948 S 


AcFEWTPGYWQJY 


949 


INap-EWTPGYYQJY 


950 


YEWTPGYYQJY 


951 


FEWVPGYYQJY 


952 


FEWTPGYYQJY 


953 


FEWTPsYYQJY 


954 


FEWTPnYYQJY 


955 


SHLY-Nap-QPYSVQM 


956 


TLVY-Nap-QPYSLQT 


957 


RGDY-Nap-QPYSVQS 


958 


NMVY-Nap-QPYSIQT 


959 


VYWQPYSVQ 


960 


VY-Nap-QPYSVQ 


961 


TFVYWQJYALPL 


962 


FEWTPGYYQJ-Bpa 


963 


XaaFEWTPGYYQJ-Bpa 


964 


FEWTPGY-Bpa-QJY 


mm 

965 


AcFEWTPGY-Bpa-QJY 


966 


FEWTPG-Bpa-YQJY 


967 


AcFEWTPG-Bpa-YQJY 


968 


AcFE-Bpa-TPGYYQJY 


969 


AcFE-Bpa-TPGYYQJY 


970 


Bpa-EWTPGYYQJY 


971 


AcBpa-EWTPGYYQJY 


972 


VYWQPYSVQ 


973 


RLVYWQPYSVQR 


974 


RLVY-Nap-QPYSVQR 


975 


RLDYWQPYSVQR 


976 


RLVWFQPYSVQR 


977 


RLVYWQPYSIQR 


978 


DNSSWYDSFLL 


980 


DNTAWYESFLA 


981 


DNTAWYENFLL 


jT^ A 

982 


PARE DNTAWYDSFLI WC 


983 


TSEY DNTTWYEKFLA SQ 


984 


SQIP DNTAWYQSFLL HG 


985 


! SPrl DNTAWYtlMrLL 1 Y 


700 


EQIY DNTAWYDHFLL SY 


987 


TPFI DNTAWYENFLL TY 


988 


TYTY DNTAWYERFLM SY 


989 


TMTQ DNTAWYENFLL SY 


990 


Tl DNTAWYANLVQ TYPQ 


"~ 991 


Tl DNTAWYERFLA QYPD 


992 


HI DNTAWYENFLL TYTP 


993 


SQ DNTAWYENFLL SYKA 


994 


Ql DNTAWYERFLL QYNA 


995 
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NO DNTAWYESFLL OYNT 


996 


Tl niMTAWYENFLL NHNL 


997 


HY DNITAWYERFLQ QGWH 


998 


FTPFTWEESNAYYWQPYALPL 


999 


YIPFTWFFSNAYYWQPYALPL 


1000 


n^YDRWROSGERYWOPYALPL 


1001 


nY-l Ma n-n Y-OJ YALP L 


1002 


TA NV^^FF WTPGYWO PYALP L 


1003 


FP\A/TPftYWOJYAI PL 


1004 


PPXA/TPftVWOPYAI PI 55D 


1005 


PPVA/TPftVYniYAI PI 


1006 


PPlA/TPrtVWfl IV 


1007 


Ar»PP\A/TPfi YWD. IY 


1008 




1009 


ApPCWTP^YYO IY 


1010 


A^CCU/TPqVWO IV 
ACr fc W 1 r a Y VVvjiJ t 


1011 


A/^PCVA/TPq\A/VO IV 

ACrfcW 1 raWTUJi 


1012 


A^CC\A/TDoWn IV 

MCrtW I roY TvJJT 


1013 


FEWTPGYYQJYALPL 


1014 


FEWTPGYWQJYALPL 


1015 


FEWTPGWYQJYALPL 


1016 


TANVSSFEWTPGYWQPYALPL 


1017 


AcFEWTPGYWQJY 


1018 


AcFEWTPGWYQJY 


1019 


AcFEWTPGYYQJY 


1020 


AcFEWTPAYWQJY 


1021 


AcFEWTPAWYQJY 


1022 


AcFEWTPAYYQJY 


1023 



\ 



31 
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Table 5 — EPOmimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YXCXXGPXTWXCXP 


83 


YXCXXGPXTWXCXP-YXCXXGPXTWXCXP 


QA 


YXCXXGPXTWXCXP-A-YXCXXGPXTWXCXP 


85 


YXCXXGPXTWXCXP-A- , . s 

V (e-amine) 


86 


\ 

/ 

YXCXXGPXTWXCXP-A- (a-amme) 


86 


GGTYSCHFGPLTWVCKPQGG 


0/ 


GGDYHCRMGPLTWVCKPLGG 


88 


GGWACRMGPITWVCSPLGG 


89 


VGNYMCHFGPITWVCRPGGG 


90 


GGLYLCRFGPVTWDCGYKGG 


91 


GGTYSCHFGPLTWVCKPQGG- 
GGTYSCHFGPLTWVCKPQGG 


92 


GGTYSCHFGPLTWVCKPQGG -A- 

uu I T ownruri. I v v vunruw 


93 




94 


GGTYSCHFGPLTWVCKPQGGSSK- 
GGTYSCHFGPLTWVCKPQGGSSK 


95 i 


GGTYSCHFGPLTWVCKPQGGSSK-A- 
GGTYSC HFG P LTW VCKPQGGSbK 




GGTYSCHrvar L I WVUJ\rUoboo v , x 

\ (e-amine) 

\ 


Q7 1 

I 


\ 
/ 

.PA 

GGTYSCHFGPLTWVCKPQGGSS (a-amine) 


97 


GGTYSCHFGPLTWVCKPQGGSSK(-A-biotin) 


98 


CXXGPXJWX,C 


421 


GGTYSCHGPLTWVCKPQGG 


422 


VGNYMAHMGPITWVCRPGG 


423 


GGPHHVYACRMGPLTWIC 


• 424 


GGTYSCHFGPLTWVCKPQ 


425 


GGLYACHMGPMTWVCQPLRG 


426 


TIAQYICYMGPETWECRPSPKA 


427 


YSCHFGPLTWVCK 


428 


YCHFGPLTWVC 


429 


XXX.GPXJWXX 


124 


YXXXXGPX.TWXX 


461 
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w ww v V V PDY T\A#V Y Y Y Y 


419 


w v/\/ r*\/ V PDV "TIA/V PY Y Y 

X^YX^CX^GPAftTWX^ApA^A,, 


420 


GGLYLCRrGPVTwDOvaYrsvjiui 


1024 


GGTYSCHFGPLTWVCKrUfciUi 


1025 1 


GGDYHCRMGPLTWVCKPLGvj 


1026 


VfiNYMCHFGPITWVCRPGGG 


1029 


GGVYACRMGPITWVCSPLGG 


1030 


VGNYMAHMGPITWVCRPGG 


1035 


GGTYSCHFGPLTWVCKPQ 


1036 


GGLYACHMGPMTWVCQPLRG 


1037 


TIAQYICYMGPETWECRPSPKA 


1038 


YSCHFGPLTWVCK 


1039 


YCHFGPLTWVC 


1040 


SCHFGPLTWVCK 


1041 


(AX,)XX.X t GPXJWXX 


1042 
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Table 6 — TPO-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


IEGPTLRQWLAARA 


13 


IFfiPTI ROWLAAKA 


24 


IF^PTl RFWLAARA 


25 


IFHPTI PfjWI AARA-A-IFfiPTL ROWLAARA 


26 


IP^DTI POWI A A If A A-IFfnPTI ROWI AAKA 
ItUr 1 LnUVVLnni\n - A ICUi 1 LnUViLnnr\n 


27 


ICODTI Dr^^l A ADA A ICO DTI DfVM AARA 

1 1 
1 1 


28 


IEGPTLRQWLAARA-A-K(BrAc)-A-IEGPTLRQWLAARA 


29 


IEGPTLRQWLAARA-A-K(PEG)-A-IEGPTLRQWLAARA 


30 


IEGPTLRQCLAARA-A-IEGPTLRQWLAARA 


31 


1 

1 EG PTLRQC LA AR A- A-l EG PTLRQ WLAA RA 


31 


IEGPTLRQWLAARA-A-IEGPTLRQCLAARA 

1 

IEGPTLRQWLAARA-A-IEGPTLRQCLAARA 


32 


32 


VRDQIXXXL 


33 


TLREWL 


34 


GRVRDQVAGW 


35 


GRVKDQIAQL 


36 


GVRDQVSWAL 


37 


ESVREQVMKY 


38 


SVRSQISASL 


39 


GVRETVYRHM 


40 


GVREVIVMHML 


41 


GRVRDQIWAAL j 


42 


AGVRDQILIWL j 


4»5 


GRVRDQIMLSL 


44 


GRVRDQ!(X) 5 L 


40 


CTLRQWLQGC 


4o 


CTLQEFLEGC 


4/ 




48 


CTLREWLHGGFC 


49 


CTLREWVFAGLC 


50 


CTLRQWLILLGMC 


51 


CTLAE FLASGVEQC 


52 


CSLQEFLSHGGYVC 


53 


CTLREFLDPTTAVC 


54 


CTLKEWLVSHEVWC 


55 


CTLREWL(X)„C 


56-60 


REGPTLRQWM 


6r 


EGPTLRQWLA 


62 


ERGPFWAKAC 


63 


REGPRCVMWM 


64 


CGTEGPTLSTWLDC 


65 



^0 
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CEQDGPTLLfcWLKG 




CE LVG PSLMSWLTO 


67 


CLTGPrVTQWLYcC 


68 


/>n a /■> o"T"l i I A/I T1 ^ 

CRAGPTLLEWLTLC 


69 


fiADttPTL RPWISFC 


70 


C(X) (J EGPTLREWL(X)..,C 


71-74 


GGCTLREWLHGGFCGG 


75 


GGCADGPTLREWISFCGG 


76 


GNADGPTLRQWLEGRRPKN 


77 


LAIEGPTLRQWLHGNGRDT 


78 


HGRVGPTLREWKTQVATKK 


79 


TIKGPTLRQWLKSREHTS 


80 


ISDGPTLKEWLSVTRGAS 


81 


SIEGPTLREWLTSRTPHS 


82 
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Table 7— G-CSF-mimetic peptide sequences 



Sequence/structure 




ID NO: 


EEDCK 


99 


EEDCK 
1 

EEDCK 


99 


99 


EEDoK 


10U l 


EEDoK 


100 


1 

EEDoK 


100 


DGIuEDoK 


101 


pGluEDoK 

1 


101 


pGluEDoK 


101 


PicSDoK 


102 


PicSDoK 
1 

PicSDoK 


102 


102 


EEDCK-A-EEDCK 


103 


EEDXK-A-EEDXK 


104 



1* 
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Table 8— TNF-antagonist peptide sequences 



Sequence/structure 

• 


SEQ 
ID NO: 


YCFTASENHCY 


106 


YCFTNSENHCY 


107 


YCFTRSENHCY 


108 


FCASENHCY 


109 


YCASENHCY 


110 


FCNSENHCY 


111 


FCNSENRCY 


112 


FCNSVENRCY 


113 


YCSQSVSNDCF 


114 


FCVSNDRCY 


IID 


YCRKELGQVCY 


116 


YCKEPGQCY 


117 


T vni\civiwiv i j 


118 


FCRKEMGCY 


119 


YCWSQNLCY 


120 


YCELSQYLCY 


121 


YCWSQNYCY 


122 


YCWSQYLCY 


123 


DFLPHYKNTSLGHRP 


1085 


AA,-AB, 

\ 

AC 

/ 

AA.-AB, 


NR 





HI 
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Table 9— Integrin-binding peptide sequences 



deQucncc/siruciuxc 


SEO 


RX,ETX,WX, 


jI ill 

441 


RX,ETX,WX, 


442 


RGDGX 


443 


CRGDGXC 


ji ^ jt 

yi n A 
Tn 


CX,X,RLDXXC 


A A P 

445 


CARRLDAPC 


A A S 

446 


CPSRLDSPC 


447 


X.XXRGDXXX. 


448 


CX,CRGDCX.C 


449 


CDCRGDCFC 


450 


CDCRGDCLC 


451 


CLCRGDCIC 


452 


XXDDXXXX 


453 


X.XXDDXXXX,X„ 


At mm 4 

454 


CWDDGWLC 


455 


CWDDLWWLC 


456 


CWDDGLMC 


* m mm 

457 


CWDDGWMC 


458 


CSWDDGWLC 


459 


CPDDLWWLC 


460 


NGR 


NR 


GSL 


NR 


RGD 


NR 


CGRECPRLCQSSC 


1071 


CNGRCVSGCAGRC 


1072 


CLSGSLSC 


1073 


RGD 


NR 


NGR 


NR 


GSL 


NR 


NGRAHA 


1074 


CNGRC 


1075 


CDCRGDCFC 


4 AT/ 

1076 


CGSLVRC 


1077 


DLXXL 


1043 


RTDLDSLRTYTL 


1044 


RTDLDSLRTY 


1053 


RTDLDSLRT 


1054 


RTDLDSLR 


1078 


GDLDLLKLRLTL 


1079 


GDLHSLRQLLSR 


1080 


RDDLHMLRLQLW 


1081 


SSDLHALKKRYG 


1082 


RGDLKQLSELTW 


1083 


RGDLAALSAPPV 


1084 



11 
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Table 10— Selectin antagonist peptide sequences 



Sequence/structure 


SEQ 


ID NO: 


DITWDOLWDLMK 


147 


DiTWDELWKIMN 


148 


DYTWFELWDMMQ 


149 


OITWAQLWNMMK 


150 


DMTWHDLWTLMS 


151 


DYSWHDLWEMMS 


152 


FITWDOLWEVMN 


153 


HV^WFOLWDIMN 


154 


WITWHOI WRIMT 


155 


RMM^Wi Ft WFHMK 


156 


MEW 1 VVL/\j<LVV n V IVIIMi rAL-sJvic 


157 


nnMCVVLMUVVCWlVIOr 


158 




159 


iTwnni u/ni mk 

1 1 VV LJVj/LVV LJL,rvir\ 


160 


Ul 1 VVUULVV L/LJVJr\ 


161 




162 


DITWDOLWDLMK 


163 


CQNRYTDLVAIQNKNE 


462 


AENWADNEPNNKRNNED 


463 


RKNNKTWTWVGTKKALTNE 


464 


KKALTNEAENWAD 


465 


CQXRYTDLVAIQNKXE 


466 


RKXNXXWTWVGTXKXLTEE 


467 


AENWADGEPNNKXNXED 


468 


CXXXYTXLVAIQNKXE 


469 


RKXXXXWXWVGTXKXLTXE 


470 


AXNWXXXEPNNXXXED 


471 


XKXKTXEAXNWXX 


472 



fx 
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Table 11— Antipathogenic peptide sequences 



sequence/structure 


SEO 

ti-\ iLjtr\, 
ID IN (J: 


GFFALIPKIISSPLFKTLLSAVGSALSSSGGQQ 


503 


GFFALIPKIISSPLFKTLLSAVGSALSSSGGQE 


504 


GFFALIPKIISSPLFKTLLSAV 


PAP 

505 


GFFALIPKIISSPLFKTLLSAV 


506 i 


KGFFALIPKIISSPLFKTLLSAV 


507 


KKGFFAUPKIISSPLFKTLLSAV 


508 


KKGFFALIPKIISSPLFKTLLSAV 


509 


GFFALIPKIIS 


510 


GIGAVLKVLTTGLPALISWIKRKRQQ 


511 


GIGAVLKVLTTGLPALISWIKRKRQQ 


512 


GIGAVLKVLTTGLPALISWIKRKRQQ 


513 


GIGAVLKVLTTGLPALISWIKR 


514 


AVLKVLTTGLPAUSWIKR 


515 


KLLLLLKLLLLK 


516 


KLLLKLLLKLLK 


517 


KLLLKLKLKLLK 


518 


KKLLKLKLKLKK 


519 


KLLLKLLLKLLK 


520 


KLLLKLKLKLLK 


521 


KLLLLK 


522 


KLLLKLLK 


523 


KLLLKLKLKLLK 


524 


KLLLKLKLKLLK 


525 


KLLLKLKLKLLK 


526 


KAAAKAAAKAAK 


527 


KVWKWVKVVK 


528 


KVWKVKVKWK 


529 


KVWKVKVKVK 


530 


KVWKVKVKWK 


531 


KLILKL 


AM f% 

532 


KVLHLL 


4% rffe 

533 


LKLRLL 


534 


KPLHLL 


535 


KLILKLVR 


536 


| *\ IPI II 1 III 

KVFHLLHL 


517 


HKFRILKL 


538 


KPFHILHL 


539 


KIIIKIKIKIIK 


540 


KIIIKIKIKIIK 


541 


KIIIKIKIKIIK 


542 


KIPIKIKIKIPK 


543 


KIPIKIKIKIVK 


544 - 


RIIIRIRIRIIR 


545 


RIIIRIRIRIIR 


546 


RIIIRIRIRIIR 


547 


RIVIRIRIRUR 


548 



% 
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RIIWRIRI RIIR 


549 


RlftlRI RVRIIR 


550 


kWIRIRIRI IR 


551 


RIAVKWRLRF1K 


552 


klftYA/kl RVRIIR 


553 


kklftWI IIRVRR 


554 


PIVIRIRIRI IRIR 


555 


PIIX/PIPI RIIRVR 


556 


□ l/^IDI D\/DIIDP\/ 


557 




558 




559 


l/irilk'APVDIIRY/k'H 


560 


DIIX/UIRI RliUILIIRI 

nil VnlnLnllnniriL. 


561 


u\n\w Ai-i\/DilQ\/i-lll 


562 


n 1 Y V i\l n L n Y I r\i\i n u 


563 


iNlvali l\M n V n 1 1 n T i\i i 


564 


Ml Y Vi\rrlrrtYiiNr\inL , , , 


565 


tsor* LI If A D DU 1 1 D Vl^l 1 


566 


I/IWID1DIDI IDIRIRIIIX/ 

NVInlnlnLlnlnlnlSIV 


567 

w ^x ■ 


nllVKIHLrillr\i\inLir\l\ _ 


568 


KIGWKLnVnllnVftlonLn 


569 


KLVIRInlnLlnlnlnWViwKnln 


570 


DCA\/l/IDI Dllt/l/IDI IkVtDk'RX/il^ 

RrAVKInLnllKKinLlftmnlSnVlft 


571 


iyA/^iA#i/i DX/DIIDX/LflflDI Rk'IfSVA/kVRV/RIK 
KAGWKLH V nil n VMunLn Mui wrsivn v nii\ 


572 


RIYVKrnrnYIKAInL 


573 


KrGHKAnrnllnYKll 


574 


KIVInlnlnLlnlnlnrvlV 


575 


nil VKInLnlll\f\lnLlf\r\ 


576 


DIN/V/CWIGlVII^Lf IDI 


577 


KIVIFTRIRLTSIRIRSIV 


578 


KPIHKARPTIIRYKMI 


579 


cvclicCKGFFALIPKIISSPLFKTLLSAVC 


580 


CKKGFFALIPKIISSPLFKTLLSAVC 


581 


CKKKGFFALIPKIISSPLFKTLLSAVC 


582 


CycIicCRIVIRIRIRLIRIRC 


583 


CyclicCKPGHKARPHIIRYKlIC 


584 


CyclicCRFA VKI RLR 1 IKKI RLI KKI RKR VI KC 


585 


KLLLKLLL KLLKC 


586 


KLLLKLLLKLLK 


587 


KLLLKLKLKLLKC 


588 


KLLLKLLLKLLK 


589 



HI 
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Table 12— VIP-mimetic peptide sequences 



Sequence/structure 


ID NO: 


HSDAVFYDNYTR LRKQMAVKKYLN SILN 


mm ^\ 

590 


Nle HSDAVFYDNYTR LRKQMAVKKYLN SILN 


mm am 

591 




592 


X,SX,LN 


593 


NH CH CO KKYX5 NH CH CO X6 

1 1 
(CH2)m Z (CH2)n 


594 




KKYL 




NSILN 


596 


KKYL 


597 


KKYA 


598 


AVKKYL 


599 


NSILN 


600 


KKYV 


601 


SILauN 


602 


KKYLNIe 


603 


NSYLN 


604 


NSIYN 


605 


KKYLPPNSILN 


606 


LauKKYL 


607 


CapKKYL 


608 


KYL 


NR 


KKYNIe 


609 


VKKYL 


610 


LNSILN 


611 


YLNSILN 


612 


KKYLN 


613 


KKYLNS 


614 ! 


KKYLN SI 


615 


KKYLNSIL 


616 


KKYL 


61/ 


KKYDA 


ill Q 

bio 


AVKKYL 


61V 


NSILN 


oZU 


\SlS\/\f 


621 

mmm ^» 


SILauN 


622 


NSYLN 


623 


NSIYN 


624 


KKYLNIe 


625 


KKYLPPNSILN 


626 


KKYL 


627 


KKYDA 


628- 


AVKKYL 


629 


NSILN 


630 


KKYV 


631 


SILauN 


632 



If 
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LauKKYL 


633 


CaoKKYL 


634 


KYL 


NR 


KYL 


NR 


KKYNie 


635 


VKKYL 


636 


LNSILN 


637 


YLNSILN 


638 


l/lf\J\ Mix* 

KKYLNie 


639 


KKYLN 


640 


\S\S\f\ KIO 

KKYLNb 


641 


KKYLN SI 


642 


KKYLN5IL 


643 


KKKYLD 


644 


cycllCCKKYLO 


645 


CKKYLK 

i ! 1 


646 


1 1 




KKYA 


647 


WWTDTG L W 


648 


wwrnnGLW 


649 


VV VVLs 1 nV3L-W V ¥V I I t 


650 


PW^NDniWl FSG 


651 


nwnnFGi wpgaa 


652 


nVVUUINCILVi VVVL 


653 


QGMWQHYG 1 WMfi 


654 


GGRWHOAGL WVA 


655 


Ki WSFOGIWMGE 


656 


rw^MWGi wi n 


657 


GHWDMTGIWVPC 


658 


HWHTRG LWVY 


659 


QLWDENGAWI 


660 


KWDDRGLWMH 


661 


QAWNERGLWT 


662 


QWDTRGLWVA 


663 


WNVHGIWQE 


664 


SWDTRGLWVE 


665 


DWDTRGLWVA 


666 


SWGROGLWIE 


667 


EWTDNGLWAL 


668 


SWDEKGLWSA 


669 


SWDSSGLWMD 


670 



19 
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Table 13— Mdm/hdm antagonist peptide sequences 



C/i^nan/ia/cttnirfiirP 
DcQUcnC^SiIuClUxc 


SEQ 

m no- 

IL/ iiv/i 


TFSDLW 




QETFSDLWKLLP 


101 


QPTFSDLWKLLP 




QETFSDYWKLLP 




QPTFSDYWKLLP 


1 Ivl 


MPRrMDYWcuUM 


135 


VQNFIDYWTQQF 


136 


TGPAFTHYWATF 


137 


IDRAPTFRDHWFALV 


138 


PRPALVFADYWETLY 


139 


PAFSRFWSDLSAGAH 


140 


PAFSRFWSKLSAGAH 


141 


PXFXDYWXXL 


142 


QETFSDLWKLLP 


143 


QPTFSDLWKLLP 


144 


QETFSDYWKLLP 


145 


QPTFSDYWKLLP 


146 



Table 14— Calmodulin antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


SCVKWGKKEFCGS 


164 


SCWKYWGKECGS 


165 


SCYEWGKLRWCGS 


166 


SCLRWGKWSNCGS 


167 


SCWRWGKYQICGS 


168 


SCVSWGALKLCGS 


169 


SCIRWGQNTFCGS 


170 


SCWQWGNLKICGS 


171 


SCVRWGQLSICGS 


172 


LKKFNARRKLKGAILTTMLAK 


173 


RRWKKNFIAVSAANRFKK 


174 


RKWQKTGHAVRAIGRLSS 


175 


INLKALAALAKKIL 


176 


KIWSILAPLGTTLVKLVA 


177 


LKKLLKLLKKLLKL 


178 


LKWKKLLKLLKKLLKKLL 


179 


AEWPSLTEIKTLSHFSV 


180 


AEWPSPTRVISTTYFGS 


181 


AELAHWPPVKTVLRSFT 


182" 


AEGSWLQLLNLMKQMNN _ 


183 


AEWPSLTEIK . 


184 
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Table 15— Mast cell antagonists/Mast cell protease inhibitor 



peptide sequences 



Sequence/structure 


SEQ 

, ID NO: 


SGSGVLKRPLPILPVTR 


272 


RWLSSRPLPPLPLPPRT 


273 


GSGSYDTLALPSLPLHPMSS 


274 


GSGSYDTRALPSLPLHPMSS 


275 


GSGSSGVTMYPKLPPHWSMA 


276 


GSGSSGVRMYPKLPPHWSMA 


277 


GSGSSSMRMVPTIPGSAKHG 


278 


RNR 


NR 


QT 


NR 


RQK 


NR 


NRQ 


NR 


RQK 


NR 


RNRQKT 


436 


RNRQ 


437 


RNRQK 


438 


NRQKT 


439 


RQKT 


440 
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Table 16— SH3 antagonist peptide sequences 



sequence/structure ■ - - 


SEO 

in MO- 
IL/ iSKJ* 


RPLPPLP 


282 | 


RELPPLP 


283 


SPLPPLP 


284 


GPLPPLP 


285 


RPLPIPP 


286 


RPLPIPP 


287 


RRLPPTP 


288 


RQLPPTP 


289 


RPLPSRP 


290 


RPLPTRP 


291 


SRLPPLP 


292 


RALPSPP 


293 


RRLPRTP 


294 


RPVPPIT 


295 


ILAPPVP 


296 


RPLPMLP 


297 


RPLPILP 


298 


RPLPSLP 


299 


RPLPSLP 


300 


RPLPMIP 


301 | 


RPLPLIP 


302 


RPLPPTP 


303 


RSLPPLP 


304 


RPQPPPP 


305 


RQLPIPP 


306 


XXXRPLPPLPXP 


307 


XXXRPLPPIPXX 


308 


XXXRPLPPLPXX 


onn 
ouy 


□VVDDI DDI DVD 

nAAnrLrrLrAr 


310 


RXXRPLPPLPPP 


311 


PPPYPPPPIPXX 


312 


PPPYPPPPVPXX 


313 


LXXRPLPXVP 


314 


<FXXRPLPXLP 


315 


PPX0XPPP¥P 


316 


+PPWXKPXWL 


317 


RPXW^R+SXP 


318 


PPVPPRPXXTL 


319 


yPTLPW 


320 


+0DXPLPXLP 


321 
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Table 17— Somatostatin or cortistatin mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


X 1 -X 2 -Asn-Phe-Phe-Trp-Lys-Thr-Phe-X J -Ser-X 4 


473 


Asp Arg Met Pro Cvs Arg Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys Lys 


474 


Met Pro Cys Arq Asn Phe Phe Trp Lys Thr Phe Ser Ser Cvs Lys 


475 


Cvs Arq Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys Lys 


476 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


477 


Met Pro Cys Arq Asn Phe Phe Trp Lys Thr Phe Ser Ser Cvs 


478 


Cvs Arq Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys 


479 


Asp Arq Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


480 


Met Pro Cys Lvs Asn Phe Phe Trp Lys Thr Phe Ser Ser Cvs Lys 


481 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys Lys 


482 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


483 


Met Pro Cvs Lvs Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


484 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


485 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


486 


Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cvs Lys 


487 ! 


Cvs Arg Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys Lys 


488 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


489 


Met Pro Cys Arq Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


490 


Cys Arg Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys 


491 


Asp Arg Met Pro Cvs Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


492 


Met Pro Cys Lvs Asn Phe Phe Trp Lys Thr Phe Thr Ser Cvs Lys 


493 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


494 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


495 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


496 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


497 
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Table 18— UKR antagonist peptide sequences 



Sequence/structure - - - - - 


>>cy . 


ID NO: 


AEPMPHSLNFSQYLWYT 


196 


AEHTYSSLWDTYSPLAF 


197 


AELDLWMRHYPLSFSNR 


198 


AESSLWTRYAWPSMPSY 


4* ^^^^ 

199 


AEWHPGLSFGSYLWSKT 


200 


AEPALLNWSFFFNPGLH 


201 


AEWSFYNLHLPEPQTIF 


202 


AEPLDLWSLYSLP PLAM 


203 


AEPTLWQLYQFPLRLSG 


204 


AEISFSELMWLRSTPAF 


205 


AELSEADLWTTWFGMGS 




AESSLWRIFSPSALMMS 


207 


AESLPTLTSILWGKESV 


208 


AETLFMDLWHDKHILLT 


209 


AEILNFPLWHEPLWSTE 


210 


AESQTGTLNTLFWNTLR 


211 


AEPVYQYELDSYLRSYY 


430 


AELDLSTFYDIQYLLRT 


431 


AEFFKLGPNGYVYLHSA 


432 


FKLXXXGYVYL 


433 


AESTYHHLSLGYMYTLN 


434 


YHXLXXGYMYT 


435 
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Table 19— Macrophage and/or 



T-cell inhibiting peptide sequences 



Sequence/structure 


ID NO: 


Xaa-Yaa-Ara 


NR 


Arg-Yaa-Xaa 


NR 


Xaa-Arg-Yaa 


NR 


Yaa-Arg-Xaa 


NR 


Ala-Arg 


NR 


Arg-Arg 


NR 


Asn-Arg 


NR 


Asp-Arg 


NR 


Cys-Arg 


NR 


Gln-Arg 


NR 


ftln-Arn 


NR 


Gly-Arq 


NR 


His-ara 


NR 


lle-Arg 


NR 


Leu-Arq 


NR 


Lvs-Ard 


NR 


Met-Arg 


NR 


Phe-Arg 


NR 


Ser-Arg 


NR 


Thr-Arg 


NR 


Trp-Arq 


NR 


Tyr-Arg 


NR 


Val-Arg 


NR 


Ala-Glu-Arg 


NR 


Arq-Glu-Arg 


NR 


Asn-Glu-Arg 


NR 


Asp-Glu-Arq 


NR 


Cys-Glu-Arg 


NR 


Gln-Glu-Arg 


NR 


Glu-Glu-Arp 


NR 


Gly-Glu-Arg 


IN IV 


His-Glu-Arg 


NR 


lle-GIu-Arg 


NR 


Leu-Glu-Arg 


NR 


Lys-Glu-Arg 


NR 


Met-Glu-Ara 


NR 


Phe-Glu-Ara 


NR 


Pro-Glu-Arq 


NR 


Ser-Glu : Am 


- NR 


Thr-Glu-Arg 


NR 


Trp-Glu-Ara 


NR 


Tyr-Glu-Arp 


NR 


Val-Glu-Arg 


NR 
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Arg-Ala 


NR 


Arg-Asp 


NR 


Arg-Cys - 


NR 


Arg-GIn 


NR 


Arg-Glu 


! NR 


Arg-Gly 


NR 


Arg-His 


NR 


Arg-lle 


NR 


Arg-Leu 


NR 


Arg-Lys 


NR 


Arg-Met 


NR 


Arg-Phe 


NR 


Arg-Pro 


NR 


Arg-Ser 


NR 


Arg-Thr 


NR 


Arg-Trp 


NR 




NR 


Arg-Val 


NR 


Arg-Glu-Ala 


NR 


Arg-Glu-Asn 


NR 


Arg-Glu-Asp 


NR 


Arg-Glu-Cys 


NR 


Arg-Glu-GIn 


NR 


Arg-Glu-Glu 


NR 


Arg-Glu-Gly 


NR 


Arg-Glu-His 


NR 


Arg-Glu-lle 


NR 


Arg-Glu-Leu 


NR 


Arg-Glu-Lys 


NR 


Arg-Glu-Met 


NR 


Arg-Glu-Phe 


NR 


Arg-Glu-Pro 


NR 


Arg-Glu-Ser 


NR 


Arg-Glu-Thr 


NR 


Arg-Glu-Trp 


NR 


Arg-Glu-Tyr 


NR 


Arg-Glu-Val 


NR 


Ala-Arg-Glu 


NR 


Arg-Arg-Glu 


NR 


Asn-Arg-Glu 


NR 


Asp-Arg-Glu 


vn? 

INK 


Cys-Arg-Glu 


NR 


Gln-Arg-Glu 


NR 


Glu-Arg-Glu 


NR 


Gly-Arg-Giu 


NR 


His-Arg-Glu 


- NR 


Ile-Arg-Glu 


NR 


Leu-Arg-Glu 


NR 


Lys-Arg-Glu 


NR 


Met-Arg-Glu 


NR 
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PhA-Arn-Glu 


NR 


Pm-Ara-Glu 


NR 


Sar-Arn-Glu 


NR 


Thr-Am-Glu 


NR 


Tm-Arn-Glu 


NR 


Tvr-Ara-Glu 


NR 


X/al-Ara-Glu 


NR 


Ynlt i-Arn- Ala 


NR 


i.Arn.Am 


NR 


f5h i.Arn-Acn 


NR 


/^li i_ A rn- Aon 


NR 


/^li i.Ain-f^i/c 


NR 


fih »- Arn-f^ln 


NR 


/"ili i.. A rn-.fi lv> 


NR 


filn-Arn-Hio 


NR 


filli i-Am-llo 


NR 


Glu-Arg-Leu 


NR 


Glu-Arg-Lys 


NR 


Glu-Arg-Met 


NR 


Glu-Arg-Phe 


NR 


Glu-Arg-Pro 


NR 


Glu-Arg-Ser 


NR 


Glu-Arg-Thr 


NR 


Glu-Arg-Trp 


NR 


Glu-Arg-Tyr 


NR 


Glu-Arg-Val 


NR 
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Table 20 — Additional Exemplary Pharmacologically Active Peptides 



OctJUcIlvc/ all Ut liiic 


SEQ 
ID 

NO: 


Activitv - - 

AAV T » If 


VEPNCDIHVMWEWECFERL 


1027 


VEGF-antagonist 


GERWCFDGPLTWVCGEES 


1084 


VEGF-antagonist 


RGWVEICVADDNGMCVTEAQ 


1085 


VEGF-antagonist 


GWDECDVARMWEWECFAGV 


1086 


VEGF- antagonist 


GERWCFDGP R A WVCG WE 1 


501 


VEGF- antagonist 


EELWCFDGPRAWVCGYVK 


502 


VEGF- antagonist 


RGWVEICAADDYGRCLTEAQ 


1031 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1087 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1088 


VEGF- antagonist 


GGNECDIARMWEWECFERL 


1089 


VEGF- antagonist 


RGWVEICAADDYGRCL 


1090 


VEGF-antagonist 


1 CTTHWGFTLC 


1028 


MMP inhibitor 


CLRSGXGC 


1091 


MMP inhibitor 


CXXHWGFXXC 


1092 


MMP inhibitor 


CXPXC 


1093 


MMP inhibitor 


CRRHWGFEFC 

X^J 1 1| 11 ITT VI fa»l \^ 


1094 


MMP inhibitor 


STTHWGFTLS 


1095 


MMP inhibitor 




1096 


CTLA4-mimetic 


GFVCSGIFAVGVGRC 


125 


CTLA4-mimetic 


APGVRLGCAVLGRYC 


126 


CTLA4-mimetic ! 


LLGRMK 


105 


Antiviral (HBV) 


ICVVQDWGHHRCTAGHMANLTSHASAI 


127 


C3b antagonist 


ICVVQDWGHHRCT 

V If VjC ¥ TVl II II 1 


128 


C3b antagonist 


CVVQDWGHHAC 

Vp* V V \mM*m* V V VI 1 ■! 1* 


129 


C3b antagonist 


STGGFDDVYDWARGVSSALTTTLVATR 


185 


Vincuiin-binding 


STGGFDDVYDWARRVSSALTTTLVATR 

VyJ ■ ^^^fl p w ■ Umr WWW m ■ V ■ ' • mm www mm w w m ■ 


186 


Vinculin-binding 


SRGVNFSEWLYDMSAAMKEASNVFPSRRSR 


187 


Vincuiin-binding 


SSONWDMEAGVEDLTAAMLGLLSTIHSSSR 


188 


Vinculin-binding 


SSPSLYTQFLVNYESAATRIQDLLIASRPSR 

^^VvpV 1 1m 1 1 ^wm% 9 mm* www m mm^mf* " 1 1 1 II Wmw mm mm f w w^m* w mm ^m w 


189 


Vincuiin-binding 


SSTGWVDLLGALQRAADATRTSI PPSLQNSR 

Xb/ \ mr J m ^mmt WW W mmw Mv\rf ■ mm m^ w WW w9 • wmw m m w ■ ■ ■ » ^™ 


190 


Vinculin-binding 


DVYTKKELIECARRVSEK 


191 


Vinculin-binding 


EKGSYYPGSGIAQFHIDYNNVS 


192 


C4BP-binding 


SGIAQFHIDYNNVSSAEGWHVN 


193 


C4BP-binding 


LVTVEKGSYYPGSGIAQFHIDYNNVSSAEGWHVN 


194 


C4BP-blnding 


SGIAQFHIDYNNVS 


195 


C4BP-binding 


LLGRMK 


279 


anti-HBV 


ALLGRMKG 


280 


anti-HBV 


LDPAFR 


281 


anti-HBV 


CXXRGDC 

• 


322 


Inhibition of platelet 
aggregation 


RPLPPLP 


323 


Src antagonist 


PPVPPR 


324 


Src antagonist 


XFXDXWXXLXX 


325 


Anti-cancer 
(particularly for 
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Qui vvl I luu / 


KACRRLFGPVDSEQLSRDCD 


326 


p16-mimetic 


RERWNFDFVTETPLEGDFAW 




fj ID •Mlllliollw 


KRRQTSMTDFYHSKRRLIFS 


328 


p16-mimetic 


TSMTDFYHSKRRLIFSKRKP 


329 


p16-mimetic 


RRLIF 


330 


p16-mimetic 


KRRQTSATDFYHSKRRLIFSRQIKIWFQNRRMKWKK 


331 


p16-mlmetic 


KRRLIFSKRQIKIWFQNRRMKWKK 


332 


p16-mimetic 


Asn Gin Gly Arg His Phe Cys Gly Gly Ala Leu He His Ala 
Ara Phe Val Met Thr Ala Ala Ser Cys Phe Gin 


498 


CAP37 mlmetic/LPS 
binding^ 


Arg His Phe Cys Gly Gly Ala Leu lie His Ala Arg Phe Val 
Met Thr Ala Ala Ser Cys 


499 


CAP37 mimetic/LPS 
binding 


Gly Thr Arg Cys Gin Val Ala Gly Tip Gly Ser Gin Arg Ser 
Gly Gly Arg Leu Ser Arg Phe Pro Arg Phe Val Asn Val 


500 


CAP37 mimetic/LPS 
binding 


WHWRHRIPLQLAAGR 


VS7t 


UaiUUl lyuialc \uU 1 

aloha) mimetic 


LKTPRV 


1098 


32GPI Ab binding 


N 1 U\ 1 rnV ,_. 


1099 


62GPI Ab binding 1 


NTLKTPRVGGC 


1100 


B2GPI Ab binding 


KDKATF 


liUl 


p^orl AD Dinainy 


KDKATFGCHD 


1102 


B2GPI Ab binding 


KDKATFGCHDGC 


1103 


32GPI Ab binding 


TLRVYK 


1104 


02GPI Ab binding 


ATLRVYKGG 


1105 


p2GPl Ab binding 


CATLRVYKGG 


1106 


02GP1 Ab binding 


INLKALAALAKKIL 


1107 


Membrane- 
transporting 


GWT 


NR 


Membrane- 
transporting 


GWTLNSAGYLLG 


1108 


Membrane- 
transporting 


GWTLNSAGYLLGKINLKALAALAKKIL 


1109 


Membrane- 
transporting 



The present invention is also particularly useful with peptides 



having activity in treatment of: 

• cancer, wherein the peptide is a VEGF-mimetic or a VEGF receptor 
antagonist, a HER2 agonist or antagonist, a CD20 antagonist and the 
like; 

• asthma, wherein the protein of interest is a CKR3 antagonist, an IL-5 
receptor antagonist, and the like; 

• thrombosis, wherein the protein of interest is a GPEb antagonist, a 
GPIIIa antagonist, and the like; 
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• autoimmune diseases and other conditions involving immune 
modulation, wherein the protein of interest is an IL-2 receptor 
antagonist, a CD40 agonist or antagonist, a CD40L agonist or 
antagonist, a thymopoietin mimetic and the like. 
5 Vehicles , This invention requires the presence of at least one vehicle 

(F 1 , F 2 ) attached to a peptide through the N-terminus, C-terminus or a 
sidechain of one of the amino acid residues. Multiple vehicles may also be 
used; e.g., Fc's at each terminus or an Fc at a terminus and a PEG group at 
the other terminus or a sidechain. 

10 An Fc domain is the preferred vehicle. The Fc domain may be fused 

to the N or C termini of the peptides or at both the N and C termini. For 
the TPO-mimetic peptides, molecules having the Fc domain fused to the N 
terminus of the peptide portion of the molecule are more bioactive than 
other such fusions, so fusion to the N terminus is preferred. 

15 As noted above, Fc variants are suitable vehicles within the scope of 

this invention. A native Fc may be extensively modified to form an Fc 
variant in accordance with this invention, provided binding to the salvage 
receptor is maintained; see, for example WO 97/34631 and WO 96/32478. 
In such Fc variants, one may remove one or more sites of a native Fc that 

2 0 provide structural features or functional activity not required by the 
fusion molecules of this invention. One may remove these sites by, for 
example, substituting or deleting residues, inserting residues into the site, 
or truncating portions containing the site. The inserted or substituted 
residues may also be altered amino acids, such as peptidomimetics or D- 

2 5 amino acids. Fc variants may be desirable for a number of reasons, several 
of which are described below. Exemplary Fc variants include molecules 
and sequences in which: 

1. Sites involved in disulfide bond formation are removed. Such removal 
may avoid reaction with other cysteine-containing proteins present in 

(ft 
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the host cell used to produce the molecules of the invention. For this 
purpose, the cysteme-containing segment at the N-terminus may be 
truncated or cysteine residues may be deleted or substituted with other 
amino acids (e.g., alanyl, seryl). In particular, one may truncate the N- 
5 terminal 20-amino acid segment of SEQ ID NO: 2 or delete or 

substitute the cysteine residues at positions 7 and 10 of SEQ ID NO: 2. 
Even when cysteine residues are removed, the single chain Fc domains 
can still form a dimeric Fc domain that is held together non-covalently. 

2. A native Fc is modified to make it more compatible with a selected host 

1 o cell. For example, one may remove the PA sequence near the N- 

terminus of a typical native Fc, which may be recognized by a digestive 
enzyme in E.coli such as proline iminopeptidase. One may also add an 
N-terminal methionine residue, especially when the molecule is 
expressed recombinantly in a bacterial cell such as E. coli. The Fc 
1 5 domain of SEQ ID NO: 2 (Figure 4) is one such Fc variant. 

3. A portion of the N-terminus of a native Fc is removed to prevent N- 
terminal heterogeneity when expressed in a selected host cell. For this 
purpose, one may delete any of the first 20 amino acid residues at the 
N-terminus, particularly those at positions 1, 2, 3, 4 and 5. 

20 4. One or more glycosylation sites are removed. Residues that are 

typically glycosylated (e.g., asparagine) may confer cytolytic response. 
Such residues may be deleted or substituted with unglycosylated 

residues (e.g., alanine). 
5. Sites involved in interaction with complement, such as the Clq binding 

2 5 site, are removed. For example, one may delete or substitute the EKK 

sequence of human IgGl. Complement recruitment may not be 
advantageous for the molecules of this invention and so may be 
avoided with such an Fc variant. 

fc/ 
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6. Sites axe removed that affect binding to Fc receptors other than a 
_ salvage receptor. A native Fc may have sites for interaction with 
certain white blood cells that are not required for the fusion molecules 
of the present invention and so may be removed. 
5 7. The ADCC site is removed. ADCC sites are known in the art; see, for 
example, Molec. Immunol . 29 (5): 633-9 (1992) with regard to ADCC 
sites in IgGl. These sites, as well, are not required for the fusion 
molecules of the present invention and so may be removed. 
8. When the native Fc is derived from a non-human antibody, the native 
10 Fc may be humanized. Typically, to humanize a native Fc, one will 

substitute selected residues in the non-human native Fc with residues 
that are normally found in human native Fc. Techniques for antibody 
humanization are well known in the art. 

Preferred Fc variants include the following. In SEQ ID NO: 2 

ft 

1 5 (Figure 4) the leucine at position 15 may be substituted with glutamate; the 
glutamate at position 99, with alanine; and the lysines at positions 101 and 
103, with alanines. In addition, one or more tyrosine residues can be 
replaced by phenyalanine residues. 

An alternative vehicle would be a protein, polypeptide, peptide, 

2 0 antibody, antibody fragment, , or small molecule (e.g., a peptidomimetic 
compound) capable of binding to a salvage receptor. For example, one 
could use as a vehicle a polypeptide as described in U.S. Pat. No. 5,739,277, 
issued April 14, 1998 to Presta etal. Peptides could also be selected by 
phage display for binding to the FcRn salvage receptor. Such salvage 

2 5 receptor-binding compounds are also included within the meaning of 
"vehicle" and are within the scope of this invention. Such vehicles should 
be selected for increased half-life (e.g., by avoiding sequences recognized 
by proteases) and decreased immunogenicity (e.g., by favoring non- 
immunogenic sequences, as discovered in antibody humanization). 
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As noted above, polymer vehicles may also be used for F and F . 
Various means for attaching chemical moieties useful as vehicles are 
currently available, see, e.g., Patent Cooperation Treaty ("PCT") 
International Publication No. WO 96/11953, entitled ''N-Terminally 
5 Chemically Modified Protein Compositions and Methods," herein 

incorporated by reference in its entirety. This PCT publication discloses, 
among other things, the selective attachment of water soluble polymers to 

the N-terminus of proteins. 

A preferred polymer vehicle is polyethylene glycol (PEG). The PEG 

1 0 group may be of any convenient molecular weight and may be linear or 
branched. The average molecular weight of the PEG will preferably range 
from about 2 kiloDalton ("kD") to about 100 kDa, more preferably from 
about 5 kDa to about 50 kDa, most preferably from about 5 kDa to about 
10 kDa. The PEG groups will generally be attached to the compounds of 

15 the invention via acylation or reductive alkylation through a reactive 

group on the PEG moiety (e.g., an aldehyde, amino, thiol, or ester group) 
to a reactive group on the inventive compound (e.g., an aldehyde, amino, 
or ester group). 

A useful strategy for the PEGylation of synthetic peptides consists 
20 of combining, through forming a conjugate linkage in solution, a peptide 
and a PEG moiety, each bearing a special functionality that is mutually 
reactive toward the other. The peptides can be easily prepared with 
conventional solid phase synthesis (see, for example, Figures 5 and 6 and 
the accompanying text herein). The peptides are "preactivated" with an 
2 5 appropriate functional group at a specific site. The precursors are purified 
and fully characterized prior to reacting with the PEG moiety. Ligation of 
the peptide with PEG usually takes place in aqueous phase and can be 
easily monitored by reverse phase analytical HPLC The PEGylated 
peptides can be easily purified by preparative HPLC and characterized by 
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analytical HPLC, amino acid analysis and laser desorption mass 

spectrometry. _ 
Polysaccharide polymers are another type of water soluble polymer 
which may be used for protein modification. Dextrans are polysaccharide 
5 polymers comprised of individual subunits of glucose predominantly 
linked by al-6 linkages. The dextran itself is available in many molecular 
weight ranges, and is readily available in molecular weights from about 1 
kD to about 70 kD. Dextran is a suitable water soluble polymer for use in 
the present invention as a vehicle by itself or in combination with another 

10 vehicle (e.g., Fc). See, for example, WO 96/11953 and WO 96/05309. The 
use of dextran conjugated to therapeutic or diagnostic immunoglobulins 
has been reported; see, for example, European Patent Publication No. 0 
315 456, which is hereby incorporated by reference. Dextran of about 1 kD 
to about 20 kD is preferred when dextran is used as a vehicle in 

1 5 accordance with the present invention. 

Linkers . Any 'linker" group is optional. When present, its chemical 
structure is not critical, since it serves primarily as a spacer. The linker is 
preferably made up of amino acids linked together by peptide bonds. 
Thus, in preferred embodiments, the linker is made up of from 1 to 20 

2 0 amino acids linked by peptide bonds, wherein the amino acids are selected 
from the 20 naturally occurring amino acids. Some of these amino acids 
may be glycosylated, as is well understood by those in the art. In a more 
preferred embodiment, the 1 to 20 amino acids are selected from glycine, 
alanine, proline, asparagine, glutamine, and lysine. Even more preferably, 

25 a linker is made up of a majority of amino acids that are sterically 
unhindered, such as glycine and alanine. Thus, preferred linkers are 
polygiycines (particularly (Gly),, (Gly) 5 ), poly(Gly-Ala), and polyalanines. - 
Other specific examples of linkers are: 

(Gly) 3 Lys(Gly) 4 (SEQ ID NO: 333); 
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(Gly) 3 AsnGlySer(Gly) 2 (SEQ ID NO: 334); 
(Gly) 3 Cys(Gly) 4 (SEQ ID NO: 335); and 
GlyProAsnGlyGly (SEQ ID NO: 336). 
To explain the above nomenclature, for example, (Gly) 3 Lys(Gly) 4 means 
5 Gly-Gly-Gly-Lys-Gly-Gly-Gly-Gly. Combinations of Gly and Ala are also 
preferred. The linkers shown here are exemplary; linkers within the scope 
of this invention may be much longer and may include other residues. 

Non-peptide linkers are also possible. For example, alkyl linkers 
such as -NH-(CH J ),-C(0)-, wherein s = 2-20 could be used. These alkyl 
1 o linkers may further be substituted by any non-sterically hindering group 
such as lower alkyl (e.g., Q-C,) lower acyl, halogen (e.g., CI, Br), CN, NH,, 
phenyl, ete. An exemplary non-peptide linker is a PEG linker, 
VI 



15 




wherein n is such that the linker has a molecular weight of 100 to 5000 kD, 
preferably 100 to 500 kD. The peptide linkers may be altered to form 
derivatives in the same manner as described above. 

Derivatives . The inventors also contemplate derivatizing the 
2 0 peptide and/or vehicle portion of the compounds. Such derivatives may 
improve the solubility, absorption, biological half life, and the like of the 
compounds. The moieties may alternatively eliminate or attenuate any 
undesirable side-effect of the compounds and the like. Exemplary 
derivatives include compounds in which: 
25 1 . The compound or some portion thereof is cyclic. For example, the 

peptide portion may be modified to contain two or more Cys residues 
(e.g., in the linker), which could cydize by disulfide bond formation. 
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For citations to references on preparation of cyclized derivatives, see 
Table 2. ........ - . 

2. The compound is cross-linked or is rendered capable of arcss-linking 
between molecules. For example, the peptide portion may be modified 
to contain one Cys residue and thereby be able to form an 
intermolecular disulfide bond with a like molecule. The compound 
may also be cross-linked through its C-terminus, as in the molecule 
shown below. 

vn 

^-(xVco-K 



F 1 -(X 1 ) b -CON 



10 3. 




4 . One or more peptidyl [-C(0)NR-1 linkages (bonds) is replaced by a 
non-peptidyl linkage. Exemplary non-peptidyl linkages are -CH,- 
carbamate [-C2VOC(0)NR-L phosphonate , -(^-sulfonamide [-CH,- 
^OjNR-], urea [-NHC(0)NH-], -O^-secondary amine, and alkylated 
1 5 peptide [-C(0)NR 6 - wherein R 6 is lower alkyl]. 

5. The N-terminus is derivatized. Typically, the N-terminus may be 
acylated or modified to a substituted amine. Exemplary N-terminal 
derivative groups include -NRR 1 (other than -NHj), -NRCXOR 1 , 
-NRC(0)OR', -NRS(0)jR\ -NHC(0)NHR\ succinimide, or 

2 0 benzyloxycarbony 1-NH- (CBZ-NH-), wherein R and R 1 are each 

independently hydrogen or lower alkyl and wherein the phenyl ring 
may be substituted with 1 to 3 substituents selected from the group 
consisting of C,-C 4 alkyl, C,-C 4 alkoxy, chloro, and bromo. 

6. The free C-terminus is derivatized. Typically, the C-tefminus is 

2 5 esterif ied or amidated. For example, one may use methods described in 
the art to add (NH-CH 2 -CH J -NH 2 ) 2 to compounds of this invention 



WO 00/24782 



PCT/US99/25044 



having any of SEQ ID NOS: 504 to 508 at the C-terminus. Likewise, 
one may use methods described in the art to add -NH, to compounds 
of this invention having any of SEQ ID NOS: 924 to 955, 963 to 972, 
1005 to 1013, or 1018 to 1023 at the C-terminus. Exemplary C-terminal 
5 derivative groups include, for example, -C(0)R 2 wherein R 2 is lower 

alkoxy or -NR'R 4 wherein R 3 and R 4 are independently hydrogen or C,- 
C 8 alkyl (preferably C,-C 4 alkyl). 

7. A disulfide bond is replaced with another, preferably more stable, 
cross-linking moiety (e.g., an alkylene). See, e.g., Bhatnagar etak 

1 o (1996), T. Med. Chem . 39: 3814-9; Alberts etal. (1993) Thirteenth Am. 

Pep. Svmp ., 357-9. 

8. One or more individual amino acid residues is modified. Various 
derivatizing agents are known to react specifically with selected 
sidechains or terminal residues, as described in detail below. 

1 5 Lysinyl residues and amino terminal residues may be reacted with 

succinic or other carboxylic acid anhydrides, which reverse the charge of the 
lysinyl residues. Other suitable reagents for derivatizing alpha-amino- 
containing residues include inudoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 

2 0 acid; O-methylisourea; 2,4 pentanedione; and transarninase-catalyzed reaction 

with glyoxylate. 

Arginyl residues may be modified by reaction with any one or 
combination of several conventional reagents, including phenylglyoxal, 2,3- 
butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginyl 
2 5 residues requires that the reaction be performed in alkaline conditions because 
of the high pKa of the guanidine functional group. Furthermore, these reagents 
may react with the groups of lysine as well as the arginine- epsuon-amino - 
group. 
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Specific modification of tyrosyl residues has been studied extensively, 
with particular interest in introducing spectral labels into tyrosyl residues by 
reaction with aromatic diazonium compounds or tetranitromethane. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 
5 tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl sidechain groups (aspartyl or glutamyl) may be selectively 
modified by reaction with carbodiimides (R-N=C=N-R') such as 1-cyclohexyl- 
3-(2-morpholinyl-(4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 
dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues 

1 o may be converted to asparaginyl and glutaminyl residues by reaction with 

ammonium ions. 

Glutaminyl and asparaginyl residues may be deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues 
are deamidated under mildly acidic conditions. Either form of these residues 
1 5 falls within the scope of this invention. 

Cysteinyl residues can be replaced by amino acid residues or other 
moieties either to eliminate disulfide bonding or, conversely, to stabilize cross- 
linking. See, e.g., Bhatnagar etal. (1996), L Med. Chem. 39: 3814-9. 

Derivatization with bifunctional agents is useful for cross-linking the 

2 0 peptides or their functional derivatives to a water-insoluble support matrix or 

to other macromolecular vehicles. Commonly used cross-linking agents 
include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
2 5 dithiobis(sucdnimidylpropionate), and bifunctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[(p- 
azidophenyl)ditWo]propioimidate yield photoactivatable intermediates that are 
capable of forming crosslinks in the presence of light. Alternatively, reactive 
water-insoluble matrices such as cyanogen bromide-activated carbohydrates 
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and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein 

immobilization. 

Carbohydrate (oligosaccharide) groups may conveniently be 
5 attached to sites that are known to be glycosylation sites in proteins. 
Generally, O-linked oligosaccharides are attached to serine (Ser) or 
threonine (Thr) residues while N-linked oligosaccharides are attached to 
asparagine (Asn) residues when they are part of the sequence Asn-X- 
Ser/Thr, where X can be any amino acid except proline. X is preferably 

1 o one of the 19 naturally occurring amino acids other than proline. The 

structures of N-linked and O-linked oligosaccharides and the sugar 
residues found in each type are different. One type of sugar that is 
commonly found on both is N-acetymeuraminic acid (referred to as sialic 
acid). Sialic acid is usually the terminal residue of both N-linked and O- 
1 5 linked oligosaccharides and, by virtue of its negative charge, may confer 
acidic properties to the glycosylated compound. Such site(s) may be 
incorporated in the linker of the compounds of this invention and are 
preferably glycosylated by a cell during recombinant production of the 
polypeptide compounds (e.g., in mammalian cells such as CHO, BHK, 

2 0 COS). However, such sites may further be glycosylated by synthetic or 

semi-synthetic procedures known in the art. 

Other possible modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
oxidation of the sulfur atom in Cys, methylation of the alpha-amino 
2 5 groups of lysine, arginine, and histidine side chains. Creighton, Proteins: 
Structure and Molecule Properties (W. H. Freeman & Co., San Francisco), 

pp. 79-86 (1983). " 

Compounds of the present invention may be changed at the DNA 
level, as well. The DNA sequence of any portion of the compound may be 

6? 
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changed to codons more compatible with the chosen host cell. For E. coli, 
which is the preferred host cell, optimized codons are known in the art. 
Codons may be substituted to eliminate restriction sites or to include silent 
restriction sites, which may aid in processing of the DNA in the selected 
5 host cell. The vehicle, linker and peptide DNA sequences may be modified 
to include any of the foregoing sequence changes. 
Methods of Making 

The compounds of this invention largely may be made in 
transformed host cells using recombinant DNA techniques. To do so, a 

1 0 recombinant DNA molecule coding for the peptide is prepared. Methods 
of preparing such DNA molecules are well known in the art. For instance, 
sequences coding for the peptides could be excised from DNA using 
suitable restriction enzymes. Alternatively, the DNA molecule could be 
synthesized using chemical synthesis techniques, such as the 

1 5 phosphoramidate method. Also, a combination of these techniques could 
be used. 

The invention also includes a vector capable of expressing the 
peptides in an appropriate host. The vector comprises the DNA molecule 
that codes for the peptides operatively linked to appropriate expression 
2 0 control sequences. Methods of effecting this operative linking, either 
before or after the DNA molecule is inserted into the vector, are well 
known. Expression control sequences include promoters, activators, 
enhancers, operators, ribosomal binding sites, start signals, stop signals, 
cap signals, polyadenylation signals, and other signals involved with the 
2 5 control of transcription or translation. 

The resulting vector having the DNA molecule thereon is used to 
' transform an appropriate host. This transformation may be performed 
using methods well known in the art. 
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Any of a large number of available and well-known host cells may 
be used in the practice of this invention. The selection of a particular host 
is dependent upon a number of factors recognized by the art. These 
include, for example, compatibility with the chosen expression vector, 
5 toxicity of the peptides encoded by the DNA molecule, rate of 

transformation, ease of recovery of the peptides, expression characteristics, 
bio-safety and costs. A balance of these factors must be struck with the 
understanding that not all hosts may be equally effective for the 
expression of a particular DNA sequence. Within these general guidelines, 

1 0 useful microbial hosts include bacteria (such as E. coli sp.), yeast (such as 
Saccharomvces sp.) and other fungi, insects, plants, mammalian (including 
human) cells in culture, or other hosts known in the art. 

Next, the transformed host is cultured and purified. Host cells may 
be cultured under conventional fermentation conditions so that the 

1 5 desired compounds are expressed. Such fermentation conditions are well 
known in the art. Finally, the peptides are purified from culture by 
methods well known in the art. 

The compounds may also be made by synthetic methods. For 
example, solid phase synthesis techniques may be used. Suitable 

2 0 techniques are well known in the art, and include those described in 
Merrifield (1973), Chem. Polypeptides, pp. 335-61 (Katsoyannis and 
Panayotis eds.); Merrifield (1963), T. Am. Chem. Soc . 85: 2149; Davis etal. 
(1985), Biochem. Intl . 10: 394^14; Stewart and Young (1969), Solid Phase 
Peptide Synthesis; U.S. Pat. No. 3,941,763; Finn etal. (1976), The Proteins 

2 5 (3rd ed.) 2: 105-253; and Erickson etal. (1976), The Proteins (3rd ed.) 2: 
257-527. Solid phase synthesis is the preferred technique of making 
individual peptides since it is the most cost-effective method of making 
small peptides. 
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Compounds that contain derivatized peptides or which contain 
non-peptide groups may be synthesized by well-known organic chemistry 
techniques. 
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Uses of the Compounds 

In general . The compounds of this invention have pharmacologic 
activity resulting from their ability to bind to proteins of interest as 
agonists, mimetics or antagonists of the native ligands of such proteins of 
5 interest. The utility of specific compounds is shown in Table 2. The activity 
of these compounds can be measured by assays known in the art. For the 
TPO-mimetic and EPO-mimetic compounds, in vivo assays are further 
described in the Examples section herein. 

In addition to therapeutic uses, the compounds of the present 

1 o invention are useful in diagnosing diseases characterized by dysfunction 

of their associated protein of interest. In one embodiment, a method of 
detecting in a biological sample a protein of interest (e.g., a receptor) that 
is capable of being activated comprising the steps of: (a) contacting the 
sample with a compound of this invention; and (b) detecting activation of 
15 the protein of interest by the compound. The biological samples include 
tissue specimens, intact cells, or extracts thereof. The compounds of this 
invention may be used as part of a diagnostic kit to detect the presence of 
their associated proteins of interest in a biological sample. Such kits 
employ the compounds of the invention having an attached label to allow 

2 0 for detection. The compounds are useful for identifying normal or 

abnormal proteins of interest. For the EPO-mimetic compounds, for 
example, presence of abnormal protein of interest in a biological sample 
may be indicative of such disorders as Diamond Blackfan anemia, where it 
is believed that the EPO receptor is dysfunctional. 
2 5 Therapeutic uses of EPO-mimetic compounds . The EPO-mimetic 

compounds of the invention are useful for treating disorders characterized 
by low red blood cell levels. Included in the invention are methods of 
modulating the endogenous activity of an EPO receptor in a mammal, 
preferably methods of increasing the activity of an EPO receptor. In 

13 
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general, any condition treatable by erythropoietin, such as anemia, may 

also be treated by the EPOmimetic compounds of the invention. These - 

compounds are administered by an amount and route of delivery that is 
appropriate for the nature and severity of the condition being treated and 
5 may be ascertained by one skilled in the art. Preferably, administration is 
by injection, either subcutaneous, intramuscular, or intravenous. 

Therapeutic uses of TPO-mimetic compounds . FortheTPO- 
mimetic compounds, one can utilize such standard assays as those 
described in W095/26746 entitled "Compositions and Methods for 

1 0 Stimulating Megakaryocyte Growth and Differentiation". In vivo assays 
also appear in the Examples hereinafter. 

The conditions to be treated are generally those that involve an 
existing megakaryocyte/platelet deficiency or an expected 
megakaryocyte/platelet deficiency (e.g., because of planned surgery or 

1 5 platelet donation). Such conditions will usually be the result of a 

deficiency (temporary or permanent) of active Mpl ligand in vivo . The 
generic term for platelet deficiency is thrombocytopenia, and hence the 
methods and compositions of the present invention are generally available 
for treating thrombocytopenia in patients in need thereof. 

2 0 Thrombocytopenia (platelet deficiencies) may be present for 

various reasons, including chemotherapy and other therapy with a variety 
of drugs, radiation therapy, surgery, accidental blood loss, and other 
specific disease conditions. Exemplary specific disease conditions that 
involve thrombocytopenia and may be treated in accordance with this 

2 5 invention are: aplastic anemia, idiopathic thrombocytopenia, metastatic 
tumors which result in thrombocytopenia, systemic lupus erythematosus, 
splenomegaly, Fanconi's syndrome, vitamin B12 deficiency; folic acid 
deficiency, May-Hegglin anomaly, Wiskott-Aldrich syndrome, and 
paroxysmal nocturnal hemoglobinuria. Also, certain treatments for AIDS 
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result in thrombocytopenia (e.g., AZT). Certain wound healing disorders 
might also benefit from an increase in platelet numbers. 

With regard to anticipated platelet deficiencies, e.g., due to future 
surgery, a compound of the present invention could be administered 
5 several days to several hours prior to the need for platelets. With regard 
to acute situations, e.g., accidental and massive blood loss, a compound of 
this invention could be administered along with blood or purified 
platelets. 

The TPO-mimetic compounds of this invention may also be useful in 

1 o stimulating certain cell types other than megakaryocytes if such cells are found 

to express Mpl receptor. Conditions associated with such cells that express the 
Mpl receptor, which are responsive to stimulation by the Mpl ligand, are also 
within the scope of this invention. 

The TPO-mimetic compounds of this invention may be used in any 
1 5 situation in which production of platelets or platelet precursor cells is desired, 
or in which stimulation of the c-Mpl receptor is desired. Thus, for example, the 
compounds of this invention may be used to treat any condition in a mammal 
wherein there is a need of platelets, megakaryocytes, and the like. Such 
conditions are described in detail in the following exemplary sources: 

2 0 W095/26746; W095/21919; W095/18858; WO95/21920 and are incorporated 

herein. 

The TPO-mimetic compounds of this invention may also be useful in 
maintaining the viability or storage life of platelets and/or megakaryocytes and 
related cells. Accordingly, it could be useful to include an effective amount of 
2 5 one or more such compounds in a composition containing such cells. 

The therapeutic methods, compositions and compounds of the 
present invention may also be employed, alone or in combination with 
other cytokines, soluble Mpl receptor, hematopoietic factors, interleukins, 
growth factors or antibodies in the treatment of disease states 
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characterized by other symptoms as well as platelet deficiencies. It is 
anticipated that the inventive compound will prove useful in treating 
some forms of thrombocytopenia in combination with general stimulators 
of hematopoiesis, such as IL-3 or GM-CSF. Other megakaryocyte 
5 stimulatory factors, i.e., meg-CSF, stem cell factor (SCF), leukemia 
inhibitory factor (LIF), oncostatin M (OSM), or other molecules with 
megakaryocyte stimulating activity may also be employed with Mpl 
ligand. Additional exemplary cytokines or hematopoietic factors for such 
co-administration include IL-1 alpha, IL-1 beta, IL-2, IL-3, IL-4, IL-5, IL-6, 

1 0 IL-11, colony stimulating factor-1 (CSF-1), SCF, GM-CSF, granulocyte 
colony stimulating factor (G-CSF), EPO, interferon-alpha (IFN-alpha), 
consensus interferon, IFN-beta, or IFN-gamma. It may further be useful to 
administer, either simultaneously or sequentially, an effective amount of a 
soluble mammalian Mpl receptor, which appears to have an effect of 

1 5 causing megakaryocytes to fragment into platelets once the 

megakaryocytes have reached mature form. Thus, administration of an 
inventive compound (to enhance the number of mature megakaryocytes) 
followed by administration of the soluble Mpl receptor (to inactivate the 
ligand and allow the mature megakaryocytes to produce platelets) is 

2 0 expected to be a particularly effective means of stimulating platelet 

production. The dosage recited above would be adjusted to compensate 
for such additional components in the therapeutic composition. Progress 
of the treated patient can be monitored by conventional methods. 

In cases where the inventive compounds are added to compositions 

25 of platelets and/or megakaryocytes and related cells, the amount to be 
included will generally be ascertained experimentally by techniques and 
assays known in the art. An exemplary range of amounts is~0.1 jig — 1 mg 
inventive compound per 10 6 cells. 
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Pharmaceutical Compositions 

In General . The present invention also provides methods of using 
pharmaceutical compositions of the inventive compounds. Such 
pharmaceutical compositions may be for administration for injection, or for 
5 oral, pulmonary, nasal, transdermal or other forms of administration. In 
general, the invention encompasses pharmaceutical compositions comprising 
effective amounts of a compound of the invention together with 
pharmaceutically acceptable diluents, preservatives, solubilizers, emulsifiers, 
adjuvants and/or carriers. Such compositions include diluents of various 

1 0 buffer content (e.g., Tris-HCl, acetate, phosphate), pH and ionic strength; 
additives such as detergents and solubilizing agents (e.g., Tween 80, 
Polysorbate 80), antioxidants (e.g., ascorbic acid, sodium metabisulfite), 
preservatives (e.g., Thimersol, benzyl alcohol) and bulking substances (e.g., 
lactose, mannitol); incorporation of the material into particulate preparations of 

1 5 polymeric compounds such as polylactic acid, polyglycolic acid, etc. or into 
liposomes. Hyaluronic acid may also be used, and this may have the effect of 
promoting sustained duration in the circulation. Such compositions may 
influence the physical state, stability, rate of in vivo release, and rate of in vivo 
clearance of the present proteins and derivatives. See, e.g., Remington's 

2 0 Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA 
18042) pages 1435-1712 which are herein incorporated by reference. The 
compositions may be prepared in liquid form, or may be in dried powder, such 
as lyophilized form. Implantable sustained release formulations are also 
contemplated, as are transdermal formulations. 

25 Oral dosage forms . Contemplated for use herein are oral solid 

dosage forms, which are described generally in Chapter 89 of Remington's 
Pharmaceutical Sciences (1990), 18th Ed., Mack Publishing Co. Easton PA - 
18042, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 
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liposomal or proteinoid encapsulation may be used to formulate the 
present compositions (as, for example, proteinoid microspheres reported _ 
in U.S. Patent No. 4,925,673). Liposomal encapsulation may be used and 
the liposomes may be derivatized with various polymers (e.g., U.S. Patent 
5 No. 5,013,556). A description of possible solid dosage forms for the 

therapeutic is given in Chapter 10 of Marshall, K., Modern Pharmaceutics 
(1979), edited by G. S. Banker and C. T. Rhodes, herein incorporated by 
r reference. In general, the formulation will include the inventive 
compound, and inert ingredients which allow for protection against the 
1 0 stomach environment, and release of the biologically active material in the 
intestine. 

Also specifically contemplated are oral dosage forms of the above 
inventive compounds. If necessary, the compounds may be chemically 
modified so that oral delivery is efficacious. Generally, the chemical 

1 5 modification contemplated is the attachment of at least one moiety to the 
compound molecule itself, where said moiety pennits (a) inhibition of 
proteolysis; and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall stability of the compound 
and increase in circulation time in the body. Moieties useful as covalently 

2 0 attached vehicles in this invention may also be used for this purpose. 
Examples of such moieties include: PEG, copolymers of ethylene glycol 
and propylene glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone and polyproline. See, for example, Abuchowski and 
Davis, Soluble Polymer-Enzyme Adducts, Enzymes as Drugs (1981), 

2 5 Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY, , pp 367- 
83; Newmark, etal. (1982), T. AppL Biochem . 4:185-9. Other polymers that 
could be used are poly-l,3-dioxolane and poly-l,3,6-tioxocane. Preferred 
for pharmaceutical usage, as indicated above, are PEG moieties. 
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For oral delivery dosage forms, it is also possible to use a salt of a 
modified aliphatic amino acid, such as sodium N-(8-[2-hydroxybenzoyl] 
amino) caprylate (SNAC), as a carrier to enhance absorption of the 
therapeutic compounds of this invention. The clinical efficacy of a heparin 
5 formulation using SNAC has been demonstrated in a Phase II trial 

conducted by Emisphere Technologies. See US Patent No. 5,792,451, "Oral 
drug delivery composition and methods". 

The compounds of this invention can be included in the 
formulation as fine multiparticulates in the form of granules or pellets of 

1 0 particle size about 1 mm. The formulation of the material for capsule 
administration could also be as a powder, lightly compressed plugs or 
even as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, 
the protein (or derivative) may be formulated (such as by liposome or 

1 5 microsphere encapsulation) and then further contained within an edible 
product, such as a refrigerated beverage containing colorants and 
flavoring agents. 

One may dilute or increase the volume of the compound of the 
invention with an inert material. These diluents could include 

2 0 carbohydrates, especially mannitol, a-lactose, anhydrous lactose, cellulose, 
sucrose, modified dextrans and starch. Certain inorganic salts may also be 
used as fillers including calcium triphosphate, magnesium carbonate and 
sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and Avicell. 

2 5 Disintegrants may be included in the formulation of the therapeutic 

into a solid dosage form. Materials used as disintegrants include but are 
not limited to starch including the commercial disintegranrtased on 
starch, Explotab. Sodium starch glycolate, Amberlite, sodium 
carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange 

79 
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peel, add carboxymethyl cellulose, natural sponge and bentonite may all 

be used. Another form of the disintegrants are the insoluble cationic 

exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or 
5 tragacanth. Alginic acid and its sodium salt are also useful as 
disintegrants. 

Binders may be used to hold the therapeutic agent together to form 
a hard tablet and include materials from natural products such as acacia, 
tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 

10 cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone 
(PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 
alcoholic solutions to granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the 
therapeutic to prevent sticking during the formulation process. Lubricants 

1 5 may be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to; stearic add induding its magnesium 
and calcium salts, polytetrafluoroethylene (PTFE), liquid paraffin, 
vegetable oils and waxes. Soluble lubricants may also be used such as 
sodium lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of 

2 0 various molecular weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during 
formulation and to aid rearrangement during compression might be 
added. The glidants may indude starch, talc, pyrogenic silica and 
hydrated silicoaluminate. 

25 To aid dissolution of the compound of this invention into the 

aqueous environment a surfactant might be added as a wetting agent. 
Surfactants may indude anionic detergents such as sodiumiauryl sulfate, 
dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. Cationic 
detergents might be used and could indude benzalkonium chloride or 

8D 
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benzethonium chloride. The list of potential nonionic detergents that 
could be included in the formulation as surfactants are lauromacrogol 400, 
polyoxyl 40 stearate, polyoxyethylene hydrogenated castor oil 10, 50 and 
60, glycerol monostearate, polysorbate 40, 60, 65 and 80, sucrose fatty acid 
5 ester, methyl cellulose and carboxymethyl cellulose. These surfactants 
could be present in the formulation of the protein or derivative either 
alone or as a mixture in different ratios. 

Additives may also be included in the formulation to enhance 
uptake of the compound. Additives potentially having this property are 

1 0 for instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Controlled release formulation may be desirable. The compound of 
this invention could be incorporated into an inert matrix which permits 
release by either diffusion or leaching mechanisms e.g., gums. Slowly 
degenerating matrices may also be incorporated into the formulation, e.g., 

1 5 alginates, polysaccharides. Another form of a controlled release of the 

compounds of this invention is by a method based on the Oros therapeutic 
system (Alza Corp.), i.e., the drug is enclosed in a semipermeable 
membrane which allows water to enter and push drug out through a 
single small opening due to osmotic effects. Some enteric coatings also 

2 0 have a delayed release effect. 

Other coatings may be used for the formulation. These include a 
variety of sugars which could be applied in a coating pan. The therapeutic 
agent could also be given in a film coated tablet and the materials used in 
this instance are divided into 2 groups. The first are the nonenteric 

2 5 materials and include methyl cellulose, ethyl cellulose, hydroxyethyl 
cellulose, methylhydroxy-ethyl cellulose, hydroxypropyl cellulose, 
hydrdxypropyl-methyl cellulose, sodium carboxy-methyl cellulose, 
providone and the polyethylene glycols. The second group consists of the 
enteric materials that are commonly esters of phthalic acid. 
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A mix of materials might be used to provide the optimum film 
coating. Film coating may be carried out in a pan coater or in a fluidized 
bed or by compression coating. 

Pulmonary delivery forms . Also contemplated herein is pulmonary 
5 delivery of the present protein (or derivatives thereof). The protein (or 
derivative) is delivered to the lungs of a mammal while inhaling and 
traverses across the lung epithelial lining to the blood stream. (Other 
reports of this include Adjei etal., Pharma. Res . (1990) 7: 565-9; Adjei etal. 
(1990), Internatl. I. Pharmaceutics 63: 135-44 (leuprolide acetate); Braquet 

1 0 etal. (1989), T. Cardiovasc. Pharmacol . 13 (suppl.5): s.143-146 (endothelin- 
1); Hubbard etal. (1989), Annals Int. Med . 3: 206-12 (al-antitrypsin); Smith 
etal. (1989), T. Clin. Invest . 84: 1145-6 (al-proteinase); Oswein etal. (March 
1990), "Aerosolization of Proteins", Proc. Svmp. R esp. Drug Delivery H, 
Keystone, Colorado (recombinant human growth hormone); Debs etal . 

1 5 (1988), T. Immunol . 140: 3482-8 (interferon-y and tumor necrosis factor a) 
and Platz etal., U.S. Patent No. 5,284,656 (granulocyte colony stimulating 
factor). 

Contemplated for use in the practice of this invention are a wide 
range of mechanical devices designed for pulmonary delivery of 

2 0 therapeutic products, including but not limited to nebulizers, metered 
dose inhalers, and powder inhalers, all of which are familiar to those 
skilled in the art. Some specific examples of commercially available 
devices suitable for the practice of this invention are the Ultravent 
nebulizer, manufactured by Mallinckrodt, Inc., St. Louis, Missouri; the 

2 5 Acorn II nebulizer, manufactured by Marquest Medical Products, 

Englewood, Colorado; the Ventolin metered dose inhaler, manufactured 
by Glaxo Inc., Research Triangle Park, North Carolina; ahdTthe Spinhaler 
powder inhaler, manufactured by Fisons Corp., Bedford, Massachusetts. 
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All such devices require the use of formulations suitable for the 
dispensing of the inventive compound. Typically, each formulation is 
specific to the type of device employed and may involve the use of an 
appropriate propellant material, in addition to diluents, adjuvants 
5 and/or carriers useful in therapy. 

The inventive compound should most advantageously be 
prepared in particulate form with an average particle size of less than 10 
^m (or microns), most preferably 0.5 to 5 urn, for most effective delivery 
to the distal lung. 

1 o Pharmaceutical^ acceptable carriers include carbohydrates such 

as trehalose, mannitol, xylitol, sucrose, lactose, and sorbitol. Other 
ingredients for use in formulations may include DPPC, DOPE, DSPC and 
DOPC Natural or synthetic surfactants may be used. PEG may be used 
(even apart from its use in derivatizing the protein or analog). Dextrans, 
1 5 such as cyclodextran, may be used. Bile salts and other related enhancers 
may be used. Cellulose and cellulose derivatives may be used. Amino 
acids may be used, such as use in a buffer formulation. 

Also, the use of liposomes, microcapsules or microspheres, 
inclusion complexes, or other types of carriers is contemplated. 

2 0 Formulations suitable for use with a nebulizer, either jet or 

ultrasonic, will typically comprise the inventive compound dissolved in 
water at a concentration of about 0.1 to 25 mg of biologically active protein 
per mL of solution. The formulation may also include a buffer and a 
simple sugar (e.g., for protein stabilization and regulation of osmotic 
2 5 pressure). The nebulizer formulation may also contain a surfactant, to 
reduce or prevent surface induced aggregation of the protein caused by 
atomization of the solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will 
generally comprise a finely divided powder containing the inventive 
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compound suspended in a propellant with the aid of a surfactant. The 
propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 
5 dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1,1,1,2- 

tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a 
surfactant. 

Formulations for dispensing from a powder inhaler device will 

1 0 comprise a finely divided dry powder containing the inventive compound 
and may also include a bulking agent, such as lactose, sorbitol, sucrose, 
mannitol, trehalose, or xylitol in amounts which facilitate dispersal of the 
powder from the device, e.g., 50 to 90% by weight of the formulation. 

Nasal delivery forms . Nasal delivery of the inventive compound is 

1 5 also contemplated. Nasal delivery allows the passage of the protein to the 
blood stream directly after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. 
Formulations for nasal delivery include those with dextran or 
cyclodextran. Delivery via transport across other mucous membranes is 

2 0 also contemplated. 

Dosages . The dosage regimen involved in a method for treating the 
above-described conditions will be determined by the attending physician, 
considering various factors which modify the action of drugs, e.g. the age, 
condition, body weight, sex and diet of the patient, the severity of any infection, 

2 5 time of administration and other clinical factors. Generally, the daily regimen 
should be in the range of 0.1-1000 micrograms of the inventive compound per 
kilogram of body weight, preferably 0.1-150 micrograms per kilogram. 
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Specific preferred embodiments 

The inventors have determined preferred peptide sequences for 
molecules having many different kinds of activity. The inventors have 
further determined preferred structures of these preferred peptides 
5 combined with preferred linkers and vehicles. Preferred structures for 
these preferred peptides listed in Table 21 below. 

Table 21 — Preferred embodiments 



Sequence/structure 


SEQ 
ID 


Activity 


F'-(G) r IEGPTLRQWLAARA-(G)„-IEGPTLRQWLAARA 


337 


TPO-mimetic 


lEGPTLRQWLAARA-tG^-IEGPTLRQWLAARA-tG),;- F 


338 


TPO-mimetic 


F'-(G) 5 -IEGPTLRQWI_AARA 


1032 


TPO-mimetic 


IEGPTLRQWLAARA -(G),- F' 


ivoo 


TPO-mimetic 


F 1 -(G) 5 -GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG 


339 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG-(G),-F' 


340 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G) 5 -F 


1034 


EPO-mimetic 


F'-(G) 5 -DFLPHYKNTSLGHRP 


1045 


TNF-a inhibitor 


DFLPHYKIMTSLGHRP-(G),-F 1 


1046 


TNF-a inhibitor 


F'-(G) 5 - FEWTPGYWQPYALPL 


1047 


IL-1 R antagonist 


FEWTPGYWQPYALPL-(G) 5 -F' 


1048 


IL-1 R antagonist 


F 1 -(G) S -VEPNCDIHVMWEWECFERL 


1049 


VEGF-antagonist 


VEPNCDIHVMWEWECFERL-(G) 5 -F' 


1050 


VEGF-antagonist 


F'-(G) 5 -CTTHWGFTLC 


1051 


MMP inhibitor 


CTTHWG FTLC-(G) 5 -F' 


1052 


MMP inhibitor 



"F" is an Fc domain as defined previously herein. 



Working examples 
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The compounds described above may be prepared as described 
below. These examples comprise preferred embodiments of the invention _ 
and are illustrative rather than limiting. 

Example 1 

5 TPO-Mimetics 

The following example uses peptides identified by the numbers 
appearing in Table A hereinafter. 

Preparation of peptide 19 . Peptide 17b (12 mg) and MeO-PEG-SH 
5000 (30 mg, 2 equiv.) were dissolved in 1 ml aqueous buffer (pH 8). The 

1 0 mixture was incubated at RT for about 30 minutes and the reaction was 
checked by analytical HPLC, which showed a > 80% completion of the 
reaction. The pegylated material was isolated by preparative HPLC. 

Preparation of peptide 20 . Peptide 18 (14 mg) and MeO-PEG- 
maleimide (25 mg) were dissolved in about 1.5 ml aqueous buffer (pH 8). 

1 5 The mixture was incubated at RT for about 30 minutes, at which time 
about 70% transformation was complete as monitored with analytical 
HPLC by applying an aliquot of sample to the HPLC column. The 
pegylated material was purified by preparative HPLC. 

Bioactivitv assay . The TPO in vitro bioassay is a mitogenic assay 

2 0 utilizing an IL-3 dependent clone of murine 32D cells that have been 

transf ected with human mpl receptor. This assay is described in greater 
detail in WO 95/26746. Cells are maintained in MEM medium containing 
10% Fetal Clone II and 1 ng/ml mIL-3. Prior to sample addition, cells are 
prepared by rinsing twice with growth medium lacking mIL-3. An 

2 5 extended twelve point TPO standard curve is prepared, ranging from 33 
to 39 pg/ml. Four dilutions, estimated to fall within the linear portion of 
the standard curve, (100 to 125 pg/ml), are prepared for eadi sample and 
run in triplicate. A volume of 100 jil of each dilution of sample or 
standard is added to appropriate wells of a 96 well microtiter plate 



WO 00/24782 



PCT/US99/25044 



containing 10,000 cells/well. After forty-four hours at 37 °C and 10% C0 2 , 
MTS (a tetrazolium compound which is bioreduced by cells to a fonnazan) 
is added to each well. Approximately six hours later, the optical density is 
read on a plate reader at 490 ran. A dose response curve (log TPO 
5 concentration vs. O.D.- Background) is generated and linear regression 
analysis of points which fall in the linear portion of the standard curve is 
performed. Concentrations of unknown test samples are determined 
using the resulting linear equation and a correction for the dilution factor. 
TMP tandem repeats with polyglycine linkers . Our design of 

1 0 sequentially linked TMP repeats was based on the assumption that a 

dimeric form of TMP was required for its effective interaction with c-Mpl 
(the TPO receptor) and that depending on how they were wound up 
against each other in the receptor context, the two TMP molecules could 
be tethered together in the C- to N-terminus configuration in a way that 

1 5 would not perturb the global dimeric conformation. Clearly, the success 
of the design of tandem linked repeats depends on proper selection of the 
length and composition of the linker that joins the C- and N-termini of the 
two sequentially aligned TMP monomers. Since no structural information 
of the TMP bound to c-Mpl was available, a series of repeated peptides 

2 0 with linkers composed of 0 to 10 and 14 glycine residues (Table A) were 
synthesized. Glycine was chosen because of its simplicity and flexibility, 
based on the rationale that a flexible polyglycine peptide chain might 
allow for the free folding of the two tethered TMP repeats into the 
required conformation, while other amino acid sequences may adopt 

2 5 undesired secondary structures whose rigidity might disrupt the correct 
packing of the repeated peptide in the receptor context. 

The resulting peptides are readily accessible by conventional solid 
phase peptide synthesis methods (Merrifield (1963), ]. Amer. Chem. Soc. 
85: 2149) with either Fmoc or t-Boc chemistry. Unlike the synthesis of the 

%1 
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C-terminally linked parallel dimer which required the use of an 
orthogonally protected lysine residue as the initial branch point to build 
the two peptide chains in a pseudosymmetrical way (Cwirla et al . (1997), 
Science 276: 1696-9), the synthesis of these tandem repeats was a 
5 straightforward, stepwise assembly of the continuous peptide chains from 
the C- to N-terminus. Since dimerization of TMP had a more dramatic 
effect on the proliferative activity than binding affinity as shown for the C- 
terminal dimer (Cwirla etal (1997)), the synthetic peptides were tested 
directly for biological activity in a TPO-dependent cell-proliferation assay 

1 0 using an IL-3 dependent clone of murine 32D cells transfected with the 
fulHength c-Mpl (Palacios etal.,. Cell 41:727 (1985)). As the test results 
showed, all the polyglycine linked tandem repeats demonstrated >1000 
fold increases in potency as compared to the monomer, and were even 
more potent than the C-terminal dimer in this cell proliferation assay. The 

1 5 absolute activity of the C-terminal dimer in our assay was lower than that 
of the native TPO protein, which is different from the previously reported 
findings in which the C-terminal dimer was found to be as active as the 
natural ligand (Cwirla etal. (1997)). This might be due to differences in 
the conditions used in the two assays. Nevertheless, the difference in 

2 0 activity between tandem (C terminal of first monomer linked to N 

terminal of second monomer) and C-terminal (C terminal of first monomer 
linked to C terminal of second monomer; also referred to as parallel) 
dimers in the same assay clearly demonstrated the superiority of tandem 
repeat strategy over parallel peptide dimerization. It is interesting to note 

2 5 that a wide range of length is tolerated by the linker. The optimal linker 
between tandem peptides with the selected TMP monomers apparently is 
composed of 8 glycines. *~ 

Other tandem repeats . Subsequent to this first series of TMP 
tandem repeats, several other molecules were designed either with 
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different linkers or containing modifications within the monomer itself. 
The first of these molecules, peptide 13, has a linker composed of GPNG, a 
sequence known to have a high propensity to form a p-turn-type 
secondary structure. Although still about 100-fold more potent than the 
5 monomer, this peptide was found to be >10-fold less active than the 
equivalent GGGG-linked analog. Thus, introduction of a relatively rigid 
p-turn at the linker region seemed to have caused a slight distortion of the 
optimal agonist conformation in this short linker form. 

The Trp9 in the IMP sequence is a highly conserved residue among 

10 the active peptides isolated from random peptide libraries. There is also a 
highly conserved Trp in the consensus sequences of EPO mimetic peptides 
and this Trp residue was f ound to be involved in the formation of a 
hydrophobic core between the two EMPs and contributed to hydrophobic 
interactions with the EPO receptor. Livnah etal. (1996), Science 273: 464- 

15 71). By analogy, the Trp9 residue in TMP might have a similar function in 
dimerization of the peptide ligand, and as an attempt to modulate and 
estimate the effects of noncovalent hydrophobic forces exerted by the two 
indole rings, several analogs were made resulting from mutations at the 
Trp. So in peptide 14, the Trp residue was replaced in each of the two 

2 0 TMP monomers with a Cys, and an intramolecular disulfide bond was 
formed between the two cysteines by oxidation which was envisioned to 
mimic the hydrophobic interactions between the two Trp residues in 
peptide dimerization. Peptide 15 is the reduced form of peptide 14. In 
peptide 16, the two Trp residues were replaced by Ala. As the assay data 

2 5 show, all three analogs were inactive. These data further demonstrated 
that Trp is critical for the activity of the TPO mimetic peptide, not just for 

dimer formation. 

The next two peptides (peptide 17a, and 18) each contain in their 8- 
amino acid linker a Lys or Cys residue. These two compounds are 

*9 
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precursors to the two PEGylated peptides (peptide 19 and 20) in which the 

side chain of the Lys or Cys is modified by a PEG moiety. A PEG moiety 

was introduced at the middle of a relatively long linker, so that the large 
PEG component (5 kDa) is far enough away from the critical binding sites 
5 in the peptide molecule. PEG is a known biocompatible polymer which is 
increasingly used as a covalent modifier to improve the pharmacokinetic 
profiles of peptide- and protein-based therapeutics. 

A modular, solution-based method was devised for convenient 
PEGylation of synthetic or recombinant peptides. The method is based on 

10 the now well established chemoselective ligation strategy which utilizes 
the specific reaction between a pair of mutually reactive functionalities. 
So, for pegylated peptide 19, the lysine side chain was preactivated with a 
bromoacetyl group to give peptide 17b to accommodate reaction with a 
thiol-derivatized PEG. To do that, an orthogonal protecting group, Dde, 

1 5 was employed for the protection of the lysine e-amine. Once the whole 
peptide chain was assembled, the N-terminal amine was reprotected with 
t-Boc. Dde was then removed to allow for the bromoacetylation. This 
strategy gave a high quality crude peptide which was easily purified using 
conventional reverse phase HPLC. Ligation of the peptide with the thiol- 

2 0 modified PEG took place in aqueous buffer at pH 8 and the reaction 
completed within 30 minutes. MALDI-MS analysis of the purified, 
pegylated material revealed a characteristic, bell-shaped spectrum with an 
increment of 44 Da between the adjacent peaks. For PEG-peptide 20, a 
cysteine residue was placed in the linker region and its side chain thiol 

2 5 group would serve as an attachment site for a maleimide-containing PEG. 
Similar conditions were used for the pegylation of this peptide. As the 
assay data revealed, these two pegylated peptides had evefT higher in vitro - 
bioactivity as compared to their unpegylated counterparts. 
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Peptide 21 has in its 8-amino acid linker a potential glycosylation 
motif, NGS. Since our exemplary tandem repeats are made up of natural 
amino acids linked by peptide bonds, expression of such a molecule in an 
appropriate eukaryotic cell system should produce a glycopeptide with 
5 the carbohydrate moiety added on the side chain carboxyamide of Asn. 
Glycosylation is a common post-translational modification process which 
can have many positive impacts on the biological activity of a given 
protein by increasing its aqueous solubility and in vivo stability. As the 
assay data show, incorporation of this glycosylation motif into the linker 

1 0 maintained high bioactivity . The synthetic precursor of the potential 
glycopeptide had in effect an activity comparable to that of the -(G) 8 - 
linked analog. Once glycosylated, this peptide is expected to have the 
same order of activity as the pegylated peptides, because of the similar 
chemophysical properties exhibited by a PEG and a carbohydrate moiety. 

1 5 The last peptide is a dimer of a tandem repeat. It was prepared by 

oxidizing peptide 18, which formed an intermolecular disulfide bond 
between the two cysteine residues located at the linker. This peptide was 
designed to address the possibility that TMP was active as a tetramer. The 
assay data showed that this peptide was not more active than an average 

2 0 tandem repeat on an adjusted molar basis, which indirectly supports the 
idea that the active form of TMP is indeed a dimer, otherwise dimerization 
of a tandem repeat would have a further impact on the bioactivity. 

In order to confirm the in vitro data in animals, one pegylated TMP 
tandem repeat (compound 20 in Table A) was delivered subcutaneously to 

2 5 normal mice via osmotic pumps. Time and dose-dependent increases 

were seen in platelet numbers for the duration of treatment. Peak platelet 
levels over 4-fold baseline were seen on day 8. A dose of 1(T ^g/kg/ day of 
the pegylated TMP repeat produced a similar response to rHuMGDF 
(non-pegylated) at 100 Mg/kg/day delivered by the same route. 

1/ 
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Table A— TPO-mimetic Peptides 



Peptide 
No. 



Compound 



SEQID Relative 
NO: Potency 



TMP-(G)„ 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17a 

17b 

18 

19 

20 

21 

22 



TPO 

TMP monomer 
TMP C-C dimer 
•TMP: 
n = 0 
n = 1 
n = 2 
n = 3 
n = 4 
n = 5 
n = 6 
n = 7 
n = 8 
n = 9 
n = 10 
n = 14 

TMP-GPNG-TMP 

IEGPTLRQCLAARA-GGGGGGGG-IEGPTLRQCLAARA 
I 1 

(cyclic) 

IEG PTLRQCLAAR A-GGGGGGGG- 

lEGPTLRQfiLAARA (linear) 

IEGPTLRQAJ-AARA-GGGGGGGG- 

IEGPTLRQALAARA 

TMP-GGGKGGGG-TMP 

TMP-GGGK(BrAc)GGGG-TMP 

TMP-GGGCGGGG-TMP 

TMP-GGGK(PEG)GGGG-TMP 

TMP-GGGC(PEG)GGGG-TMP 

TMP-GGGN*GSGG-TMP 

TMP-GGGCGGGG-TMP 
I 

TMP-GGGCGGGG-TMP 



13 



341 
342 
343 
344 
345 
346 
347 
348 
349 
350 
351 
352 
353 
354 

355 

356 

357 
358 
359 
360 
361 
362 
363- 

363 



+ 
+++- 

++++- 



++++ 
++++ 
++++ 
++++ 



++++ 



+++ 



ND 



+-H-++ 
+++++ 
++++ 

++++ 
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Discussion . It is well accepted that MGDF acts in a way similar to 
hGH, i.e., one molecule of the protein ligand binds two molecules of the 
receptor for its activation. Wells etal.(1996), Ann. Rev. Biochem . 65: 609- 
34. Now, this interaction is mimicked by the action of a much smaller 
5 peptide, TMP. However, the present studies suggest that this mimicry 
requires the concerted action of two TMP molecules, as covalent 
dimerization of TMP in either a C-C parallel or C-N sequential fashion 
increased the in vitro biological potency of the original monomer by a 
factor of greater than 10 3 . The relatively low biopotency of the monomer is 
1 0 probably due to inefficient formation of the noncovalent dimer. A 

preformed covalent repeat has the ability to eliminate the entropy barrier 
for the formation of a noncovalent dimer which is exclusively driven by 
weak, noncovalent interactions between two molecules of the small, 14- 
residue peptide. 

15 It is intriguing that this tandem repeat approach had a similar effect 

on enhancing bioactivity as the reported C-C dimerization is intriguing. 
These two strategies brought about two very different molecular 
configurations. The C-C dimer is a quasi-symmetrical molecule, while the 
tandem repeats have no such symmetry in their linear structures. Despite 

20 this difference in their primary structures, these two types of molecules 
appeared able to fold effectively into a similar biologically active 
conformation and cause the dimerization and activation of c-Mpl. These 
experimental observations provide a number of insights into how the two 
TMP molecules may interact with one another in binding to c-Mpl. First, 

25 the two C-termini of the two bound TMP molecules must be in relatively 
close proximity with each other, as suggested by data on the C-terminal 
dimer. Second, the respective N- and C-termini of the two TMP molecules - 
in the receptor complex must also be very closely aligned with each other, 
such that they can be directly tethered together with a single peptide bond 
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to realize the near maximum activity-enhancing effect brought about by 
the tandem repeat strategy. Insertion of one or more (up to 14) glycine 
residues at the junction did not increase (or decrease) significantly the 
activity any further. This may be due to the fact that a flexible polyglycine 
5 peptide chain is able to loop out easily from the junction without causing 
any significant changes in the overall conformation. This flexibility seems 
to provide the freedom of orientation for the TMP peptide chains to fold 
into the required conformation in interacting with the receptor and 
validate it as a site of modification. Indirect evidence supporting this 

1 0 came from the study on peptide 13, in which a much more rigid b-turn- 
forming sequence as the linker apparently forced a deviation of the 
backbone alignment around the linker which might have resulted in a 
slight distortion of the optimal conformation, thus resulting in a moderate 
(10-fold) decrease in activity as compared with the analogous compound 

1 5 with a 4-Gly linker. Third, Trp9 in TMP plays a similar role as Trpl3 in 
EMP, which is involved not only in peptide:peptide interaction for the 
formation of dimers but also is important for contributing hydrophobic 
forces in peptideireceptor interaction. Results obtained with the W to C 
mutant analog, peptide 14, suggest that a covalent disulfide linkage is not 

2 0 sufficient to approximate the hydrophobic interactions provided by the 
Trp pair and that, being a short linkage, it might bring the two TMP 
monomers too close, therefore perturbing the overall conformation of the 
optimal dimeric structure. 

An analysis of the possible secondary structure of the TMP peptide 

2 5 can provide further understanding on the interaction between TMP and c- 
Mpl. This can be facilitated by making reference to the reported structure 
of the EPO mimetic peptide. Livnah etal. (1996), Science 273:464-75 The 
receptor-bound EMP has a b-hairpin structure with a b-turn formed by the 
highly consensus Gly-Pro-Leu-Thr at the center of its sequence. Instead of 
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GPLT, TMP has a highly selected GPTL sequence which is likely to form a 
similar turn. However, this turn-like motif is located near the N-terminal 
part in TMP. Secondary structure prediction using Chau-Fasman method 
suggests that the C-terminal half of the peptide has a tendency to adopt a 
5 helical conformation. Together with the highly conserved Trp at position 
9, this C-terminal helix may contribute to the stabilization of the dimeric 
structure. It is interesting to note that most of our tandem repeats are 
more potent than the C-terminal parallel dimer. Tandem repeats seem to 
give the molecule a better fit conformation than does the C-C parallel 

1 0 dimerization. The seemingly asymmetric feature of a tandem repeat 

might have brought it closer to the natural ligand which, as an asymmetric 
molecule, uses two different sites to bind two identical receptor molecules. 

Introduction of a PEG moiety was envisaged to enhance the in vivo 
activity of the modified peptide by providing it a protection against 

1 5 proteolytic degradation and by slowing down its clearance through renal 
filtration. It was unexpected that pegylation could further increase the in 
vitro bioactivity of a tandem repeated TMP peptide in the cell-based 
proliferation assay. 

Example 2 

20 Fe-TMP fusions 

TMPs (and EMPs as described in Example 3) were expressed in 
either monomelic or dimeric form as either N-terminal or C-terminal 
fusions to the Fc region of human IgGl. In all cases, the expression 
construct utilized the luxPR promoter promoter in the plasmid expression 

25 vector pAMG21. 

Fc-TMP. A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the TPOmimetic peptide was constructed - 
using standard PCR technology. Templates for PCR reactions were the 
pFc-A3 vector and a synthetic TMP gene. The synthetic gene was 



WO 00/24782 



PCT/US99/25044 



constructed from the 3 overlapping oligonucleotides (SEQ ID NOS: 364, 
365/ and 366, respectively) shown below: 

1842-97 AAA AAA GGA TCC TCG AGA TTA AGC ACG AGC AGC CAG CCA 

CTG ACG CAG AGT CGG ACC 

5 

1842-98 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG ACT CTG CGT 

1842-99 CAG TGG CTG GCT GCT CGT GCT TAA TCT CGA GGA TCC TTT 

TTT 

10 

These oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 367 and 368, respectively) shown 
below: 

1 5 AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

1 + + + + + + 60 

CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a KGGGG GI EGPTLRQWLAARA 

20 TAATCTCGAGGATCCTTTTTT 

61 + + - 81 

ATTAGAGCTCCTAGGAAAAAA 
a * 

2 5 This duplex was amplified in a PCR reaction using 1842-98 and 1842-97 as 

the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers shown below (SEQ ID NOS: 369 and 370): 

30 1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1830-51 TTCGATACCA CCACCTCCAC CTTTACCCGG AGACAGGGAG AGGCTCTTCTGC 

The oligonucleotides 1830-51 and 1842-98 contain an overlap of 24 

3 5 nucleotides, allowing the two genes to be fused together in the correct 

reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1842-97. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 

4 0 into the vector pAMG21 and transformed into competent E. coli strain 

2596 cells as described for EMP-Fc herein. Clones were screened for the 
ability to produce the recombinant protein product and to possess the 
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gene fusion having the correct nucleotide sequence. A single such done 
was selected and designated Amgen strain #3728, 

The nucleotide and amino acid sequences (SEQ ID NOS: 5 and 6) of 
the fusion protein are shown in Figure 7. 
5 Fc-TMP-TMP , A DNA sequence coding for the Fc region of human 

IgGl fused in-frame to a dimer of the TPO-mimetic peptide was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFc~A3 vector and a synthetic TMP-TMP gene. The synthetic 
gene was constructed from the 4 overlapping oligonucleotides (SEQ ID 
1 0 NOS: 371 to 374, respectively) shown below: 

1830-52 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG 

ACT CTG CGT CAG TGG CTG GCT GCT CGT GCT 

15 1830-53 ACC TCC ACC ACC AGC ACG AGC AGC CAG 

CCA CTG ACG CAG AGT CGG ACC 

1830-54 GGT GGT GGA GGT GGC GGC GGA GGT ATT GAG GGC CCA ACC 

CTT CGC CAA TGG CTT GCA GCA CGC GCA 

20 

1830-55 AAA AAA AGG ATC CTC GAG ATT ATG CGC GTG CTG CAA GCC 

ATT GGC GAA GGG TTG GGC CCT CAA TAC CTC CGC CGC C 

The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino acid sequence (SEQ ID NOS: 375 and 376, respectively) shown 

below: 

AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

^ + + + + + + 60 

3Q CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a RGGGGGIEGPTLRQWLAARA 

GGTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCA 

3 5 CCACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGT 

a GGGGGGGGIEGPTLRQWLAA 
CGCGCA 

121 148 

4 0 GCGCGTATTAGAGCTCCTAGGAAAAAAA 

a R A * - 

This duplex was amplified in a PCR reaction using 1830-52 and 1830-55 as 
45 the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 1216-52 and 1830-51 as described above for 

^7 
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Fc-TMP. The full length fusion gene was obtained from a third PCR 
reaction using the outside primers 1216-52 and 1830-55. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HI, and then ligated 
5 into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described in example 1. Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3727. 

1 o The nucleotide and amino acid sequences (SEQ ID NOS: 7 and 8) of 

the fusion protein are shown in Figure 8. 

TMP-TMP-Fc. A DN A sequence coding for a tandem repeat of the 
TPOmimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 

1 5 were the EMP-Fc plasmid from strain #3688 (see Example 3) and a 
synthetic gene encoding the TMP dimer. The synthetic gene for the 
tandem repeat was constructed from the 7 overlapping oligonucleotides 
shown below (SEQ ID NOS: 377 to 383, respectively): 

20 1885-52 TTT TTT CAT ATG ATC GAA GGT CCG ACT CTG CGT CAG TGG 

1885-53 AGC ACG AGC AGC CAG CCA CTG ACG CAG AGT CGG ACC TTC 

GAT CAT ATG 

25 1885-54 CTG GCT GCT CGT GCT GGT GGA GGC GGT GGG GAC AAA ACT 

CAC AC A 



30 



1885-55 CTG GCT GCT CGT GCT GGC GGT GGT GGC GGA GGG GGT GGC 

ATT GAG GGC CCA 

1885-56 AAG CCA TTG GCG AAG GGT TGG GCC CTC AAT GCC ACC CCC 

TCC GCC ACC ACC GCC 



1885-57 ACC CTT CGC CAA TGG CTT GCA GCA CGC GCA GGG GGA GGC 

35 ~ GGT GGG GAC AAA ACT 

1885-58 CCC ACC GCC TCC CCC TGC GCG TGC TGC 

... — 

These oligonucleotides were annealed to form the duplex shown encoding 
40 an amino acid sequence shown below (SEQ ID NOS 384 and 385): 
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10 



15 



20 



TTTTTTCATATGATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTGGCGGT 

1 + + + + + + 60 

GTATACTAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGACCGCCA 
a MIEGPTLRQWLAARAGG- 



a 



GGTGGCGGAGGGGGTGGCATTGAGGGCCCAACCCTTCGCCAATGGCTGGCTGCTCGTGCT 

+ + + + + + 

CCACCGCCTCCCCCACCGTAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTGCGCGT 

GGGGGGIEGPTLRQWLAA RA 



GGTGGAGGCGGTGGGGACAAAACTCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAA 

12 i .- + + + + - - - + + 180 

CCCCCTCCGCCACCC 

a QGGGGDKTLAARAGGGGGDK 

ACTCACACA 
181 189 



a T H T 

This duplex was amplified in a PCR reaction vising 1885-52 and 1885-58 as 
the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with DNA from the EMP-Fc fusion strain #3688 (see Example 3) using the 

2 5 primers 1885-54 and 1200-54. The full length fusion gene was obtained 

from a third PCR reaction using the outside primers 1885-52 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamH I, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 

3 0 2596 cells as described for Fc-EMP herein. Clones were screened for the 

ability to produce the recombinant protein product and to possess the 
gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3798. 

The nucelotide and amino acid sequences (SEQ ID NOS: 9 and 10) 
35 of the fusion protein are shown in Figure 9. 

TMP-Fc . A DNA sequence coding for a monomer of the TPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
obtained fortuitously in the ligation in TMP-TMP-Fc, presumably due to 
the ability of primer 1885-54 to anneal to 1885-53 as well as to 1885-58. A 

4 0 single clone having the correct nucleotide sequence for the TMP-Fc 

construct was selected and designated Amgen strain #3788. 

1? 
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The nucleotide and amino acid sequences (SEQ ID NOS: 11 and 12) 
of the fusion protein are shown in Figure 10. - - 

Expression in E. coli . Cultures of each of the pAMG21-Fc-fusion 
constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
5 containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 

1 0 cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Retractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 

1 5 10% b-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

pAMG21 . The expression plasmid pAMG21 can be derived from 
the Amgen expression vector pCFM1656 (ATCC #69576) which in turn be 

2 0 derived from the Amgen expression vector system described in US Patent 
No. 4,710,473. The pCFM1656 plasmid can be derived from the described 
pCFM836 plasmid (Patent No. 4,710,473) by: 

(a) destroying the two endogenous Ndel restriction sites by end 
filling with T4 polymerase enzyme followed by blunt end 

2 5 ligation; 

(b) replacing the DNA sequence between the unique Aatll and Clal 
restriction sites containing the synthetic Pl proriidter with a 

similar fragment obtained from pCFM636 (patent No. 4,710,473) 
containing the PL promoter (see SEQ ID NO: 386 below); and 

in 
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(c) substituting the small DNA sequence between the unique Clal 
and Kpnl re striction sites with the oligonucleotide having the 
sequence of SEQ ID NO: 388. 
SEQ ID NO: 386: 

AatXI 

fT CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT - 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA - 

- AAAAAAC ATAC AGAT AACC ATC TGC GGTGATAAATTATCTC TGGC GGTGTTGACAT AAA - 

- TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT - 

- TACCACTGGCGGTGATACTGAGCACAT 3 ' 

- ATGGTGACCGCCACTATGACTCGTGTAGC 5 ' 

Oai 

SEQ ID NO: 387: 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' 

Clal 

The expression plasmid pAMG21 can then be derived from pCFM1656 by 
making a series of site-directed base changes by PCR overlapping oligo 
mutagenesis and DNA sequence substitutions. Starting with the BgUI site 
(plasmid bp # 180) immediately 5' to the plasmid replication promoter 
p copB and proceeding toward the plasmid replication genes, the base pair 
changes are as shown in Table B below. 



10! 
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Table B— Base pair changes resulting in pAMG21 



P AMG21 bo # bpinpCFM1656 



5 # 204 

# 428 

# 509 

# 617 

# 679 
10 # 980 

# 994 
#1004 
#1007 
#1028 

15 #1047 

#1178 

#1466 

#2028 

#2187 
20 #2480 

# 2499-2502 



T/A 
A/T 
G/C 

G/C 
T/A 
G/C 
A/T 
C/G 
A/T 
C/G 
G/C 
G/C 
G/C 
C/G 
A/T 

AGTG 
TCAC 



hp changed to in dAMG21 

C/G 
G/C 
A/T 

insert two G/C bp 

T/A 

C/G 

A/T 

C/G 

T/A 

T/A 

T/A 

T/A 

T/A 
bp deletion 

T/A 

T/A 

GTCA 
CAGT 



25 # 2642 



TCCGAGC 
AGGCTCG 



7 bp deletion 



30 



#3435 
#3446 
#3643 



G/C 
G/C 
A/T 



A/T 
A/T 
T/A 



/0> 
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The DNA sequence between the unique Aa£ll (position #4364 in 
pCFM1656) and SacH (position #4585 in pCFM1656) restriction sites is 
substituted with the DNA sequence (SEQ ID NO: 23) shown in Figures 
17 A and 17B. During the ligation of the sticky ends of this substitution 
5 DNA sequence, the outside Aaffl and SacH sites are destroyed. There are 
unique Aaffl and SacH sites in the substituted DNA. 

CiM221 (Amgen #2596 ). The Amgen host strain #2596 is an Rcoli K- 
12 strain derived from Amgen strain #393. It has been modified to contain 
both the temperature sensitive lambda repressor cI857s7 in the early ebg 
1 0 region and the lad 0 repressor in the late ebg region (68 minutes). The 
presence of these two repressor genes allows the use of this host with a 
variety of expression systems, however both of these repressors are 
irrelevant to the expression from luxP R . The untransformed host has no 

antibiotic resistances. 

15 The ribosome binding site of the cI857s7 gene has been modified to 

include an enhanced RBS. It has been inserted into the ebg operon 
between nucleotide position 1170 and 1411 as numbered in Genbank 
accession number M6444lGb_Ba with deletion of the intervening ebg 
sequence. The sequence of the insert is shown below with lower case 

2 0 letters representing the ebg sequences flanking the insert shown below 
(SEQ ID NO: 388): 

35 The construct was delivered to the chromosome using a 

recombinant phage called MMebg-cI857s7enhanced RBS #4 into F'tet/393. 
After recombination and resolution only the chromosomal insert described 

1 01 
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above remains in the cell. It was renamed F'tet/GMIOI. F'tet/GMlOl was 
then modified by the delivery of a lacl° construct into the ebg operon 
between nucleotide position 2493 and 2937 as numbered in the Genbank 
accession number M64441GbJBa with the deletion of the intervening ebg 
5 sequence. The sequence of the insert is shown below with the lower case 
letters representing the ebg sequences flanking the insert (SEQ ID NO: 
389) shown below: 

ggcggaaaccGACGTCCATCGAATGGTGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCGGAAGAGAGTCA 
ATTCAGGGTGGTGAATGTGAAACCAGTAACGTTATACGATGTCGCAGAGTATGCCGGTGTCTCTTATCAGACC 

1 0 GTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAACGCGGGAAAAAGTCGAAGCGGCGATGGCGG 
AGCTGAATTAC ATTCCCAACCGCGTGGCACAACAAC TGGCGGGC AAAC AGTCGCTCCTGATTGGC GTTGCC AC 
CTCCAGTCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATCTCGCGCCGATCAACTGGGTGCC 
AGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCGC 
AAC GCGTCAGTGGGCTGATCATTAACTATCC GCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCAC 

1 5 TAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCAT^ 

GGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACCAGCAAATCGCGCTGTTAGCGGGCCCATTAA 
GTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGC 
GGAACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTT 
CCCACTGCGATGCTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGC 

20 GCGTTGGTGCGGATATCTCGGTAGTGGGATACGACGATACCGAAGACAGCTCATGTTATATCCCGCCGTTAAC 
CACCATCAAAGAGGATTTTCGCCTGCTGGGGCAAACCAGCGT 

GCGGTGAAGGGCTiATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTGGCGCCCAATACGCA^ 

CCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGT^ 

GTAAGGTACCATAGGATCCaggcacagga 

25 

The construct was delivered to the chromosome using a 
recombinant phage called AGebg-LacIQ#5 into Ftet/GMIOI. After 
recombination and resolution only the chromosomal insert described 
above remains in the cell. It was renamed F'tet/GM221. The F'tet episome 
3 0 was cured from the strain using acridine orange at a concentration of 25 
^ig/ml in LB. The cured strain was identified as tetracyline sensitive and 
was stored as GM221. 



Expression . Cultures of pAMG21-Fc-TMP-TMP in £. coli GM221 in 

3 5 Luria Broth medium containing 50 ng/ml kanamycin were incubated at 

37°C prior to induction. Induction of Fc-TMP-TMP gene product 

expression from the luxPR promoter was achieved foUowing the addition 

of the synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to 

the culture media to a final concentration of 20 ng/ml and cultures were 

40 incubated at 37°C for a further 3 hours. After 3 hours, the bacterial 

hH 
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cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-TMP-TMP 
was most likely produced in the insoluble fraction in £. coli. Cell pellets 
5 were lysed directly by resuspension in Laemmli sample buffer containing 
10% •-mercaptoethanol and were analyzed by SDS-PAGE. An intense 
Coomassie stained band of approximately 30kDa was observed on an 
SDS-PAGE gel. The expected gene product would be 269 amino acids in 
length and have an expected molecular weight of about 29.5 kDa. 

1 o Fermentation was also carried out under standard batch conditions at the 
10 L scale, resulting in similar expression levels of the Fc-TMP-TMP to 
those obtained at bench scale. 

Purification of Fc-TMP-TMP . Cells are broken in water (1/10) by 
high pressure homogenization (2 passes at 14,000 PSI) and inclusion 

1 5 bodies are harvested by centrifugation (4200 RPM in J-6B for 1 hour). 
Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM DTT, 
pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 20 
times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 

20 10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this rruxture is then adjusted to pH 5 with acetic 
acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5(10mg/ml protein load, room temperature). 

25 The protein is eluted off using a 20 column volume gradient in the same 
buffer ranging from lOOmM NaCl to 500mM NaCl. The pool from the 
column is diluted 3 fold and loaded onto a SP-Sepharose HP column in 20 
mM NaAc, 150 mM NaCl, pH 5(10 mg/ml protein load, room 
temperature). The protein is eluted off using a 20 column volume gradient 

In 
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in the same buffer ranging from 150 mM NaCl to 400 mM NaCl. The peak 

is pooled and filtered. 

Characterization of Fc-TMP activity . The following is a summary of 

in vivo data in mice with various compounds of this invention. . 
5 Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 

started 4 days apart for a total of 20 mice per group. Five mice bled at each 

time point, mice were bled a minimum of three times a week. Mice were 

anesthetized with isoflurane and a total volume of 140-160 pi of blood was 
1 0 obtained by puncture of the orbital sinus. Blood was counted on a 

Technicon HIE blood analyzer running software for murine blood. 

Parameters measured were white blood cells, red blood cells, hematocrit, 

hemoglobin, platelets, neutrophils. 

Treatments: Mice were either injected subcutaneously for a bolus 
1 5 treatment or implanted with 7-day micro-osmotic pumps for continuous 

delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 

Osmotic pumps were inserted into a subcutaneous incision made in the 

skin between the scapulae of anesthetized mice. Compounds were diluted 

in PBS with 0.1% BSA. All experiments included one control group, 
2 0 labeled "carrier" that were treated with this diluent only. The 

concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 

the graphs. 

Compounds: A dose titration of the compound was delivered to 
2 5 mice in 7 day micro-osmotic pumps. Mice were treated with various 
compounds at a single dose of 100 ng/kg in 7 day osmotic pumps. Some 
of the same compounds were then given to mice as a siriglebolus injection. 

Activity test results: The results of the activity experiments are 
shown in Figures 11 and 12. In dose response assays using 7-day micro- 
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osmotic pumps, the maximum effect was seen with the compound of SEQ 
ID NO: 18 was at 100 ug/kg/day; the 10 ug/kg/day dose was about 50% 
maximally active and 1 ug/kg/day was the lowest dose at which activity 
could be seen in this assay system. The compound at 10 ug/kg/day dose 
was about equally active as 100 ug/kg/day unpegylated rHu-MGDF in 
the same experiment. 



1 
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Example 3 

FoEMP fusions _ _ 

Fc-EMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the EPO-mimetic peptide was constructed 
5 using standard PCR technology. Templates for PCR reactions were a 
vector containing the Fc sequence (pFc-A3, described in International 
application WO 97/23614, published July 3, 1997) and a synthetic gene 
encoding EPO monomer. The synthetic gene for the monomer was 
constructed from the 4 overlapping oligonucleotides (SEQ ID NOS: 390 to 
1 0 393, respectively) shown below: 

1798-2 TAT GAA AGG TGG AGG TGG TGG TGG AGG TAC TTA CTC TTG 
CCA CTT CGG CCC GCT GAC TTG G 

15 1798-3 CGG TTT GCA AAC CCA AGT CAG CGG GCC GAA GTG GCA AGA 

GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT CAT 



20 



1798-4 GTT TGC AAA CCG CAG GGT GGC GGC GGC GGC GGC GGT GGT 
ACC TAT TCC TGT CAT TTT 

1798-5 CCA GGT CAG CGG GCC AAA ATG ACA GGA ATA GGT ACC ACC 
GCC GCC GCC GCC GCC ACC CTG 



The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino acid sequence (SEQ ID NOS: 394 and 395, respectively) shown 

below: 

TATGAAAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTG 
1 + + + + + + 60 

3 0 TACTTTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAAC 

b MKGGGGGGGTYSCHFGPLTW 

GGTTTGC AAAC CGC AGGGTGGC GGC GGC GGC GGCGGTGGT ACC TATTC CTGTC ATTTT 

£1 + + + + + + + -- 133 

3 5 CCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACC 
b VCKPQGGGGGGGGTYSCHF 

This duplex was amplified in a PCR reaction using 

40 1798-18 GCA GAA GAG CCT CTC CCT GTC TCC GGG TAA 

AGG TGG AGG TGG TGG TGG AGG TAC TTA 
CTC T 



45 



and 



1798-19 CTA ATT GGA TCC ACG AGA TTA ACC ACC 

CTG CGG TTT GCA A 
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as the sense and antisense primers (SEQ ID NOS: 396 and 397, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

5 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

' 1798-17 AGA GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT ACC CGG 

AGA CAG GGA GAG GCT CTT CTG C 

10 

which are SEQ ID NOS: 398 and 399, respectively. The oligonucleotides 
1798-17 and 1798-18 contain an overlap of 61 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
above PCR products in a third reaction using the outside primers, 1216-52 

15 and 1798-19. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 
into the vector pAMG21 (described below), also digested with Xbal and 
BamH I. Ligated DNA was transformed into competent host cells of E. coli 

2 0 strain 2596 (GM221, described herein). Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3718. 

The nucleotide and amino acid sequence of the resulting fusion 

2 5 protein (SEQ ID NOS: 15 and 16) are shown in Figure 13. 

EMP-Fc . A DNA sequence coding for a monomer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFC-A3a vector and a synthetic gene encoding EPO monomer. 

3 0 The synthetic gene for the monomer was constructed from the 4 

overlapping oligonucleotides 17984 and 1798-5 (above) and 1798-6 and 
1798-7 (SEQ ID NOS: 400 and 401, respectively) shown below: 

)0(j 
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1798-6 GGC CCG CTG ACC TGG GTA TGT AAG CCA CAA GGG GGT GGG 
GGA GGC GGG GGG TAA TCT CGA G 

5 1798-7 GAT CCT CGA GAT TAG CCC CCG CCT CCC CCA CCC CCT TGT 
GGC TTA CAT AC 

The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 402 and 403, respectively) shown 
1 0 below: 



GTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGC 

1 + + + + + + 60 

GTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCG 
15 A VCKPQGGGGGGGGTYSCHFG 

CCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGTAATCTCGAG 

61 + + + + + + " 122 

GGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCATTAGAGCTCCTAG 
20 A PLTWVCKPQGGGGGGG* 

This duplex was amplified in a PCR reaction using 



1798-21 TTA TTT CAT ATG AAA GGT GGT AAC TAT TCC TGT CAT TTT 

25 

and 



1798-22 TGG ACA TGT GTG AGT TTT GTC CCC CCC GCC TCC CCC ACC 

CCC T 

30 

as the sense and antisense primers (SEQ ID NOS: 404 and 405, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

35 

1798-23 AGG GGG TGG GGG AGG CGG GGG GGA CAA AAC TCA CAC ATG 

TCC A 

40 1200-54 GTT ATT GCT CAG CGG TGG CA 

which are SEQ ID NOS: 406 and 407, respectively. The oligonucleotides 
1798-22 and 1798-23 contain an overlap of 43 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
4 5 above PCR products in a third reaction using the outside primers, 1787-21 
and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 

//0 
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into the vector p AMG21 and transformed into competent E. coli strain 
2596 cells as described above. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
5 and designated Amgen strain #3688. 

The nucleotide and amino acid sequences (SEQ ID NOS: 17 and 18) 
of the resulting fusion protein are shown in Figure 14. 

EMP-EMP-Fc . A DNA sequence coding for a dimer of the EPO 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
1 0 constructed using standard PCR technology. Templates for PCR reactions 
were the EMP-Fc plasmid from strain #3688 above and a synthetic gene 
encoding the EPO dimer. The synthetic gene for the dimer was 
constructed from the 8 overlapping oligonucleotides (SEQ ID NOS:408 to 
415, respectively) shown below: 

15 

1869-23 TTT TTT ATC GAT TTG ATT CTA GAT TTG AGT TTT AAC TTT 

TAG AAG GAG GAA TAA AAT ATG 

1869-48 TAA AAG TTA AAA CTC AAA TCT AGA ATC AAA TCG ATA AAA 

20 AA 

1871-72 GGA GGT ACT TAC TCT TGC CAC TTC GGC CCG CTG ACT TGG 

GTT TGC AAA CCG 

25 1871-73 AGT CAG CGG GCC GAA GTG GCA AGA GTA AGT ACC TCC CAT 

ATT TTA TTC CTC CTT C 

1871-74 CAG GGT GGC GGC GGC GGC GGC GGT GGT ACC TAT TCC TGT 

CAT TTT GGC CCG CTG ACC TGG 

30 

1871-75 AAA ATG ACA GGA ATA GGT ACC ACC GCC GCC GCC GCC GCC 

ACC CTG CGG TTT GCA AAC CCA 

1871-78 GTA TGT AAG CCA CAA GGG GGT GGG GGA GGC GGG GGG GAC 

35 AAA ACT CAC ACA TGT CCA 

1871-79 AGT TTT GTC CCC CCC GCC TCC CCC ACC CCC TTG TGG CTT 

ACA TAC CCA GGT CAG CGG GCC 

40 The 8 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 416 and 417, respectively) shown 
below: 

TTTTTTATCGATTTGATTCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATG 

. j- - . + __.._---- + -- «- -- -- - + -- -- + ----+• - -•-----+ 60 

AAA^TAGCTAAACTAAGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATAC 

M 

i 
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GGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTTGCAAACCGCAGGGTGGC 

61 + + + + + + 120 

CCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAACGTTTGGCGTCCCACCG 
5 a GGTYSCHFGPLTWV C KPQG G 

GGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAG 

121 + + + + + + 180 

CCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTC 
10 a GGGGGGTYSCHFGPLTWVCK 

CCACAAGGGGGTGGGGGAGGCGGGGGGGACAAAACTCACACATGTCCA 

131 ----- .+.--»------ + -- -- -- -- - + -- -- •- -- 228 

GGTGTTCCCCCACCCCCTCCGCCCCCCCTGTTTTGA 
15 a pQGGGGGGGDKTHTCP 

This duplex was amplified in a PCR reaction using 1869-23 and 
1871-79 (shown above) as the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
2 0 with strain 3688 DNA using the primers 1798-23 and 1200-54 (shown 
above). 

The oligonucleotides 1871-79 and 1798-23 contain an overlap of 31 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 

2 5 using the outside primers, 1869-23 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHL and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for ability to 

3 0 produce the recombinant protein product and possession of the gene 

fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3813. 

The nucleotide and amino acid sequences (SEQ ED NOS: 19 and 20, 
respectively) of the resulting fusion protein are shown in Figure 15. There 
35 is a silent mutation at position 145 (A to G, shown in boldface) such that 
the final construct has a different nucleotide sequence than the 
oligonucleotide 1871-72 from which it was derived. 

Fc-EMP-EMP . A DNA sequence coding for the Fc region of human 
IgGl fused in-frame to a dimer of the EPOmimetic peptide was 
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constructed using standard PCR technology. Templates for PCR reactions 
were the plasmids from strains 3688 and 3813 above. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1216-52 and 1798-17 (shown 
5 above). The EMP dimer portion of the molecule was the product of a 

second PCR reaction with strain 3813 DNA using the primers 1798-18 (also 
shown above) and SEQ ID NO: 418, shown below: 

1798-20 CTA ATT GGA TCC TCG AGA TTA ACC CCC TTG TGG CTT ACAT 

10 

The oligonucleotides 1798-17 and 1798-18 contain an overlap of 61 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1798-20, 
1 5 The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xba l and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
2 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #3822. 

The nucleotide and amino acid sequences (SEQ ID NOS: and , 

respectively) of the fusion protein are shown in Figure 16. 

Characterization of Fc-EMP activity . Characterization was carried 

2 5 out in vivo as follows. 

Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at 
each time point, mice were bled a maximum of three times a week. Mice 

3 0 were anesthetized with isoflurane and a total volume of 140-160 ml of 

blood was obtained by puncture of the orbital sinus. Blood was counted 



WO 00/24782 PCT/US99/25044 

on a Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were WBC, RBC, HCT, HGB, PLT, NEUT, LYMPH. 

Treatments: Mice were either injected subcutaneously for a bolus 
treatment or implanted with 7 day micro-osmotic pumps for continuous 
5 delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
labeled "carrier" that were treated with this diluent only. The 
1 0 concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Experiments: Various Fc-conjugated EPO mimetic peptides (EMPs) 
were delivered to mice as a single bolus injection at a dose of 100 ng/kg. 
1 5 Fc-EMPs were delivered to mice in 7-day micro-osmotic pumps. The 

* 

pumps were not replaced at the end of 7 days. Mice were bled until day 
51 when HGB and HCT returned to baseline levels. 

Example 4 
TNF-ot inhibitors 

2 0 Fc-TNF-a inhibitors . A DNA sequence coding for the Fc region of 

human IgGl fused in-frame to a monomer of the TNF-a inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

2 5 primer 1216-52 and the antisense primer 2295-89 (SEQ ID NOS: 1112 and 
1113 , respectively). The nucleotides encoding the TNF-a inhibitory 
peptide were provided by the PCR primer 2295-89 shown Below: 



30 



1216-52 
2295-89 



AAC ATA AGT ACC TGT AGG ATC G 

CCG CGG ATC CAT TAC GGA CGG TGA CCC AGA GAG GTG TTT TTG TAG 



WO 00/24782 



PCT/US99/25044 



TGC GGC AGG AAG TCA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2295-89 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
5 being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 

1 o produce the recombinant protein product and to possess the gene fusion 

having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4544. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1055 and 
1056) of the fusion protein are shown in Figures 19A and 19B. 
1 5 TNF-a inhibitor-Fc . A DNA sequence coding for a TNF-a inhibitory 

peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the TNF-a inhibitory peptide were 

2 0 provided by the sense PCR primer 2295-88, with primer 1200-54 serving as 

the antisense primer (SEQ ID NOS: 1117 and 407, respectively). The 
primer sequences are shown below: 

2295-88 GAA TAA CAT ATG GAC TTC CTG CCG CAC TAC AAA AAC ACC TCT CTG GGT 

25 CAC CGT CCG GGT GGA GGC GGT GGG GAC AAA ACT 

1200-54 GTT ATT GCT CAG CGG TGG CA 

30 

The oligonucleotide 2295-88 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
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The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
5 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4543. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1057 and 1058) of 
the fusion protein are shown in Figures 20A and 20B. 

1 0 Expression in E. coli . Cultures of each of the p AMG21-Fc-fusion 

constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
containing 50 mg/ ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 

1 5 culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 
cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 

2 0 were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 
10% p-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

25 Purification of Fc-peptide fusion proteins . Cells are broken in water 

(1/10) by high pressure homogenization (2 passes at 14,000 PSI) and 
inclusion bodies are harvested by centrifugation (4200 RPM in J-6B for 1 
hour). Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM 
DTT, pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 
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20 times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 
10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 

■ 

5 acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5 (lOmg/ml protein load, room temperature). 
The protein is eluted from the column using a 20 column volume gradient 
in the same buffer ranging from lOOmM NaCl to 500mM NaCL The pool 

1 0 from the column is diluted 3 fold and loaded onto a SP-Sepharose HP 

column in 20mM NaAc, 150mM NaCl, pH 5(10mg/ml protein load, room 
temperature). The protein is eluted using a 20 column volume gradient in 
the same buffer ranging from 150mM NaCl to 400mM NaCl. The peak is 
pooled and filtered. 

15 Characterization of activity of Fc-TNF-cc inhibitor and TNF-a 

inhibitor -Fc . Binding of these peptide fusion proteins to TNF- a can be 
characterized by BIAcore by methods available to one of ordinary skill in 
the art who is armed with the teachings of the present specification. 

Example 5 

20 IL-1 Antagonists 

Fc-IL-1 antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an IL-1 antagonist peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 

2 5 from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

primer 1216-52 and the antisense primer 2269-70 (SEQ ID NOS: 1112 and 
1118, respectively). The nucleotides encoding the IL-lanlagonist peptide - 
were provided by the PCR primer 2269-70 shown below: 



II? 
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1216-52 AAC ATA AGT ACC TGT AGG ATC G 

2269-70 CCG CGG ATC CAT TAC AGC GGC AGA GCG TAC GGC TGC CAG TAA CCC 

GGG GTC CAT TCG AAA CCA CCA CCT CCA CCT TTA CCC 

5 * ~ 

The oligonucleotide 2269-70 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

1 o The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHL and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
1 5 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4506. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1059 and 
1060) of the fusion protein are shown in Figures 21 A and 21B. 

IL-1 antagonist-Fc A DNA sequence coding for an IL-1 antagonist 

2 0 peptide fused in-frame to the Fc region of human IgGl was constructed 

using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the IL-1 antagonist peptide were provided 
by the sense PCR primer 2269-69, with primer 1200-54 serving as the 
2 5 antisense primer (SEQ ID NOS: 1119 and 407, respectively). The primer 
sequences are shown below: 

2269-69 GAA TAA CAT ATG TTC GAA TGG ACC CCG GGT TAC TGG CAG CCG TAC GCT 

30 ~ CTG CCG CTG GGT GGA GGC GGT GGG GAC AAA ACT 

* 

1200-54 GTT ATT GCT CAG CGG TGG CA 



I// 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

1 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4505. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1061 and 
1062) of the fusion protein are shown in Figures 22A and 22B. Expression 
and purification were carried out as in previous examples. 

15 Characterization of Fc-IL-1 antaeonist peptide an d IL-1 antagonist 

peptide-Fc activity . IL-1 Receptor Binding competition between IL-lp, IL- 
1RA and Fc-conjugated IL-1 peptide sequences was carried out using the 
IGEN system. Reactions contained 0.4 nM biotin-IL-lR + 15 nM IL-l-TAG 
+ 3 uM competitor + 20 ug/ml streptavidin-conjugate beads, where 

2 0 competitors were IL-1RA, Fc-IL-1 antagonist, IL-1 antagonist-Fc). 

Competition was assayed over a range of competitor concentrations from 
3 uM to 1.5 pM. The results are shown in Table C below: 
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Table C— Results from IL-1 Receptor Binding Competition Assay 



IL~1pep-Fc Fc-IL-1pep tL-1ra 

5 Kl 281.5 59.58 1.405 

EC50 530.0 112.2 2.645 

95% Confidence Intervals 

10 EC50 280.2 to 1002 54.75 to 229.8 1.149 to 

6.086 



15 



Kl 148.9 to 532.5 29.08 to 122.1 0.61 06 to 

3.233 

Goodness off Fit 

R* 0.9790 0.9687 0.9602 
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Example 6 
VEGF- Antagonists 
FoVEGF Antagonist . A DNA sequence coding for the Fc region of 
5 human IgGl fused in-frame to a monomer of the VEGF mimetic peptide 
was constructed using standard PCR technology. The templates for the 
PCR reaction were the pFc-A3 plasmid and a synthetic VEGF mimetic 
peptide gene. The synthetic gene was assembled by annealing the 
following two oligonucleotides primer (SEQ ID NOS: 1120 and 1121, 
1 0 respectively): 

2293-11 GTT GAA CCG AAC TGT GAC ATC CAT GTT ATG TGG GAA TGG GAA 

TGT TTT GAA CGT CTG 

2293-12 CAG ACG TTC AAA ACA TTC CCA TTC CCA CAT AAC ATG GAT GTC 

15 ACA GTT CGG TTC AAC 

The two oligonucleotides anneal to form the following duplex encoding 
an amino acid sequence shown below (SEQ ID NOS 1122 ): 

20 

GTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTG 

1 + -- - + + + + 

CAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCAGAC 

25 a VE PNCD I HVMWEWE C F ERL 

This duplex was amplified in a PCR reaction using 2293-05 and 2293-06 as 
the sense and antisense primers (SEQ ID NOS. 1125 and 1126). 
3 0 The Fc portion of the molecule was generated in a PCR reaction 

with the pFc-A3 plasmid using the primers 2293-03 and 2293-04 as the 
sense and antisense primers (SEQ ID NOS. 1123 and 1124rrespectively). . 
The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-03 and 2293-06. These primers are shown below: 
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2293-03 ATT- TGA TTC TAG AAG GAG GAA TAA CAT ATG GAC AAA ACT CAC 

ACA TGT 

5 2293-04 GTC ACA GTT CGG TTC AAC ACC ACC ACC ACC ACC TTT ACC CGG 

AGA CAG GGA 

2293-05 TCC CTG TCT CCG GGT AAA GGT GGT GGT GGT GGT GTT GAA CCG 

AAC TGT GAC ATC 



10 



2293-06 CCG CGG ATC CTC GAG TTA CAG ACG TTC AAA ACA TTC CCA 



The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and BamHI, and then ligated into the 
1 5 vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4523. 

2 0 The nucleotide and amino acid sequences (SEQ ID NOS: 1063 and 

1064) of the fusion protein are shown in Figures 23A and 23B. 

VEGF antagonist -Fc . A DNA sequence coding for a VEGF mimetic 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The templates for the PCR reaction were 
25 the pFc-A3 plasmid and the synthetic VEGF mimetic peptide gene 

described above. The synthetic duplex was amplified in a PCR reaction 
using 2293-07 and 2293-08 as the sense and antisense primers (SEQ ID 
NOS. 1127 and 1128, respectively). 

The Fc portion of the molecule was generated in a PCR reaction 

3 0 with the pFc-A3 plasmid using the primers 2293-09 and 2293-10 as the 

sense and antisense primers (SEQ ID NOS. 1129 and 1130, respectively). 
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The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-07 and 2293-10. These primers are shown below: 

2293-07 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GTT GAA CCG AAC 

5 TGT GAC 

2293-08 AC A TGT GTG AGT TTT GTC ACC ACC ACC ACC ACC CAG ACG TTC 

AAA ACA TTC 

10 2293-09 GAA TGT TTT GAA CGT CTG GGT GGT GGT GGT GGT GAC AAA ACT 

CAC ACA TGT 

2293-10 CCG CGG ATC CTC GAG TTA TTT ACC CGG AGA CAG GGA GAG 

The PCR gene product (the full length fusion gene) was digested 
1 5 with restriction endonucleases Nde l and BamHL and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

2 0 and designated Amgen strain #4524. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1065 and 
1066) of the fusion protein are shown in Figures 24A and 24B. Expression 
and purification were carried out as in previous examples. 

25 Example 7 

MMP Inhibitors 

Fc-MMP inhibitor . A DN A sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an MMP inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 

3 0 linker portion of the molecule was generated in a PCR reaction with DNA 

from the Fc-TNF-a inhibitor fusion strain #4544 (see Example 4) using the 
sense primer 1216-52 and the antisense primer 2308-67 (SEQ ID NOS: 1112 

U3 
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and 1131, respectively). The nucleotides encoding the MMP inhibitor 
peptide were provided by the PGR primer 2308-67 shown below: 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

5 

2308-67 CCG CGG ATC CAT TAG CAC AGG GTG AAA CCC CAG TGG GTG GTG 

CAA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2308-67 overlaps the glycine linker and Fc portion of 

1 0 the template by 22 nucleotides, with the PGR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 

1 5 described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4597. 

The nucleotide and amino add sequences (SEQ ID NOS: 1067 and 

2 0 1068) of the fusion protein are shown in Figures 25A and 25B. Expression 
and purification were carried out as in previous examples. 

MMP Inhibitor-Fc . A DNA sequence coding for an MMP inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The Fc and 5 glycine linker portion of the 

2 5 molecule was generated in a PCR reaction with DNA from the Fc-TNF-a 
inhibitor fusion strain #4543 (see Example 4). The nucleotides encoding 
the MMP inhibitory peptide were provided by the sense PCR primer 2308- 
66, with primer 1200-54 serving as the antisense primer (SEQ ID NOS: 
1132 and 407, respectively). The primer sequences are shown below: 

30 

2308-66 GAA TAA CAT ATG TGC ACC ACC CAC TGG GGT TTC ACC CTG TGC 

GGT GGA GGC GGT GGG GAC AAA 

35 1200-54 GTT ATT GCT CAG CGG TGG CA 

)3M 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamM, and then ligated into the 
vector pAMG21 and transformed into competent E.coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
1 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4598. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1069 and 

1070) of the fusion protein are shown in Figures 26 A and 26B. 

* # * 

1 5 The invention now being fully described, it will be apparent to one 

of ordinary skill in the art that many changes and modifications can be 
made thereto, without departing from the spirit and scope of the invention 
as set forth herein. 



2 o Abbreviations 

Abbreviations used throughout this specification are as defined 
below, unless otherwise defined in specific circumstances. 

Ac acetyl (used to refer to acetylated residues) 

AcBpa acetylated p-benzoyl-L-phenylalanine 
2 5 ADCC antibody-dependent cellular cytotoxicity 

Aib aminoisobutyric acid 

bA beta-alanine 

Bpa p-benzoyl-L-phenylalanine 

BrAc bromoacetyl (BrCH 2 C(0) 
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BSA Bovine serum albumin 

Bzl Benzyl 

Cap Caproic acid 

CTL Cytotoxic T lymphocytes 

5 CTLA4 Cytotoxic T lymphocyte antigen 4 

DARC Duffy blood group antigen receptor 

DCC Dicylcohexylcarbodiimi.de 

Dde l-(4 / 4-dimethyl-2,6-dioxo-cyclohexylidene)ethyl 

EMP Erythropoietin-mimetic peptide 

1 0 ESI-MS Electron spray ionization mass spectrometry 

EPO Erythropoietin 

Fmoc fluorenylmethoxycarbonyl 

G-CSF Granulocyte colony stimulating factor 

GH Growth hormone 

15 HCT hematocrit 

HGB hemoglobin 

hGH Human growth hormone 

HOBt 1-Hydroxybenzotriazole 

HPLC high performance liquid chromatography 

20 IL interleukin 

IL-R interleukin receptor 

IL-1R interleukin-1 receptor 

IL-lra interleukin-1 receptor antagonist 

Lau Laurie acid 

25 LPS lipopolysaccharide 

LYMPH lymphocytes 
- MALDI-MS Matrix-assisted laser desorption ionizatiortmass 

spectrometry 

Me methyl 

m 
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MeO methoxy 

MHC major histocompatibility complex 

MMP matrix metalloproteinase 

MMPI matrix metalloproteinase inhibitor 

5 1-Nap 1-napthylalanine 

NEUT neutrophils 

NGF nerve growth factor 

Nle norleucdne 

NMP N-methyl-2-pyrrolidinone 

1 0 PAGE polyacrylamide gel electrophoresis 

PBS Phosphate-buffered saline 

Pbf 2,2,4,6,7-pendamethyldihydrobenzofuran-S-sulfonyl 

PCR polymerase chain reaction 

Pec pipecolic acid 

15 PEG Poly (ethylene glycol) 

pGlu pyroglutamic acid 

Pic picolinic acid 

PUT platelets 

pY phosphotyrosine 

2 0 RBC red blood cells 

RBS ribosome binding site 

RT room temperature (25 °C) 

Sar sarcosine 

SDS sodium dodecyl sulfate 

2 5 STK serine-threonine kinases 

t-Boc tert-Butoxycarbonyl 

tBu tert-Butyl - — 

TGF tissue growth factor 

THF thymic humoral factor 

m 
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TK 


tyrosine kinase 


TMP 


Thrombopoietin-mimetic peptide 


TNF 


Tissue necrosis factor 


TPO 


Thrombopoietin 


TRAIL 


TNF-related apoptosis-inducing ligand 


Trt 


trityl 


UK 


urokinase 


UKR 


urokinase receptor 


VEGF 


vascular endothelial cell growth factor 


VIP 


vasoactive intestinal peptide 


WBC 


white blood cells 
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What is claimed is: 

1 . A composition of matter of the formula 

and multimers thereof, wherein: 
5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(LVP 1 , - 
(LVP'-a 2 ), -P 2 , -(LVP^d^-O-V^ and -(l\P x -(L\-P-CL\ -P 3 - 

avp 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
1 0 pharmacologically active peptides; 

L\ L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
that at least one of a and b is L 

2. The composition of matter of Claim 1 of the formulae 

is x l -r 



or 

•1 \j2 



3. The composition of matter of Claim 1 of the formula 

F , -(L , ) C -P 1 . 

20 4. The composition of matter of Claim 1 of the formula 

FMLVP'-fl-VP 2 . 

5. The composition of matter of Claim 1 wherein F 1 is an IgG Fc 
domain. 

6. The composition of matter of Claim 1 wherein F 1 is an IgGl Fc 
2 5 domain. 

7. The composition of matter of Claim 1 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

8. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an IL-1 antagonist peptide sequence. 



The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 212, 907, 908, 909, 
910, 917, and 979. 

The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 213 to 271, 671 to 
906, 911 to 916, and 918 to 1023. 

The composition of matter of Claim 8 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an EPO-mimetic peptide sequence. 

The composition of matter of Claim 12 wherein the EPO-mimetic 
peptide sequence is selected from Table 5. 
The composition of matter of Claim 12 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 12 comprising a sequence 
selected from SEQ ID NOS: 83, 84, 85, 124, 419, 420, 421, and 461. . 
The composition of matter of claim 12 comprising a sequence 
selected from SEQ ID NOS: 339 and 340. 
The composition of matter of Claim 12 comprising a sequence 
selected from SEQ ID NOS: 20 and 22. 

The composition of matter of Claim 3 wherein P 1 is a TPO-mimetic 
peptide sequence. 

The composition of matter of Claim 18 wherein P 1 is a TPO-mimetic 
peptide sequence selected from Table 6. 

The composition of matter of Claim 18 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 18 having a sequence selected 
from SEQ ID NOS: 6 and 12. 

A DNA encoding a composition of matter of any of Claims 1 to 21. 
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23. An expression vector comprising the DNA of Claim 22. 

24. A host cell comprising the expression vector of Claim 23. 

25. The cell of Claim 24, wherein the cell is an E. coli cell. 

26. A process for preparing a pharmacologically active compound, 
5 which comprises 

a) selecting at least one randomized peptide that modulates the 
activity of a protein of interest; and 

b) preparing a pharmacologic agent comprising at least one Fc 
domain covalently linked to at least one amino acid sequence 

10 of the selected peptide or peptides. 

27. The process of Claim 26, wherein the peptide is selected in a process 
comprising screening of a phage display library, an E. coli display 
library, a ribosomal library, or a chemical peptide library. 

28. The process of Claim 26, wherein the preparation of the 
1 5 pharmacologic agent is carried out by: 

a) preparing a gene construct comprising a nucleic acid 
sequence encoding the selected peptide and a nucleic acid 
sequence encoding an Fc domain; and 

b) expressing the gene construct. 

2 0 29. The process of Claim 26, wherein the gene construct is expressed in 

an E. coli cell. 

30. The process of Claim 26, wherein the protein of interest is a cell 
surface receptor. 

31. The process of Claim 26, wherein the protein of interest has a linear 
2 5 epitope. 

32. The process of Claim 26, wherein the protein of interest is a 

cytokine receptor. - — • 

33. The process of Claim 26, wherein the peptide is an EPO-mimetic 

peptide. 
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34. The process of Claim 26, wherein the peptide is a TPO-mimetic 
. peptide. 

35. The process of Claim 26, wherein the peptide is an IL-1 antagonist 
peptide. 

5 36. The process of Claim 26, wherein the peptide is an MMP inhibitor 
peptide or a VEGF antagonist peptide. 

37. The process of Claim 26, wherein the peptide is a TNF-antagonist 
peptide. 

38. The process of Claim 26, wherein the peptide is a CTLA4-mimetic 
1 0 peptide. 

39. The process of Claim 26, wherein the peptide is selected from 
Tables 4 to 20. 

40. The process of Claim 26, wherein the selection of the peptide is 
carried out by a process comprising: 

15 a) preparing a gene construct comprising a nucleic acid 

sequence encoding a first selected peptide and a nucleic acid 
sequence encoding an Fc domain; 
b) conducting a polymerase chain reaction using the gene 
construct and mutagenic primers, wherein 
20 i) a first mutagenic primer comprises a nucleic acid 

sequence complementary to a sequence at or near the 
5' end of a coding strand of the gene construct, and 
ii) a second mutagenic primer comprises a nucleic acid 
sequence complementary to the 3' end of the 
2 5 noncoding strand of the gene construct. 

41 . The process of Claim 26, wherein the compound is derivatized. 

42. The process of Claim 26, wherein the derivatized compoimd 
comprises a cyclic portion, a cross-linking site, a non-peptidyl 
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linkage, an N-terminal replacement, a C-terminal replacement, or a 
modified amino acid moiety. 
43. The process of Claim 26 wherein the Fc domain is an IgG Fc 
domain. 

5 44. The process of Claim 26, wherein the vehicle is an IgGl Fc domain. 

45. The process of Claim 26, wherein the vehicle comprises the 
sequence of SEQ ID NO: 2. 

46. The process of Claim 26, wherein the compound prepared is of the 
formula 

io (XVFMA 

and multimers thereof,wherein: 
F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(C) -P\ - 

(lvp'-oA, -p 2 , -aVP^LV^a-VP 3 , *nd -aA-p'-a 2 )^<c\ -p 3 - 

15 (LVP 4 

P\ P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
2 0 that at least one of a and b is 1 . 

47. The process of Claim 46, wherein the compound prepared is of the 
formulae 

X 1 -F 1 

or 

25 F'-X 2 . 

48. The process of Claim 46, wherein the compound prepared is of the 

■ 

formulae 

F-flA-P 1 
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or 

49. The process of Claim 46, wherein F 1 is an IgG Fc domain. 

50. The process of Claim 46, wherein F l is an IgGl Fc domain. 

51. The process of Claim 46, wherein F l comprises the sequence of SEQ 
ID NO: 2. 
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FIG. 1 

peptide selection 

i 

peptide optimization 

I 

formation of Fc-peptide DNA construct 

I 

insertion of construct into expression vector 

I 

transfection of host cell with vector 

I 

expression of vector in host cell 

i 

Fc multimer formation in host cell 

i 

isolation of Fc multimer from host cell 
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FIG. 2A 



FIG. 2B 
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FIG. 3A 
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FIG. 4 



ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 
1 + .+.......__+ -+ 60 

TACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGT 

MDKTHTCPPCPAPELLGGPS 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCGCTGAGGTC 
51 . + + + + + + 120 

CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAG 
VFIiFPPKPKDTLMISRTPEV 

ACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTG 

121 + + + + + + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

TCVVVDVSHEDPEVKFNWYV 

GACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACG 

181 + + + + + + 240 

CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 + + + + + + 300 

ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATG 

Y. RVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 + + + * + + 360 

TTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGG 

KCKVSNKALPAPIEKTISKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 + + + + + -*+ 420 

TTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGG 

KGQ PRE PQVYTL P P S RDE LT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 + + + + + - -+ 480 

TTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCAC 

* 

KNQVS LTC LVKGFYP S D I AV 

GAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

481 + + + + + + 540 

CTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

E W E S N G Q PENNYKTT P PVLD 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAG 

54! + + + + + + 600 

AGGCTGC C GAGG AAG AAGGAG ATGTC GTTC GAGTGGCACCTGTTCTC GTCC ACC GTC GTC . 

SDGS FFLYSKLTVDKSRWQQ 

GGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAG 

601 * + + + * 660 

CCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTC 

GNVFSC SVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 * + 684 

TCGGAGAGGGACAGAGGCCCATTT 
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FIG. 5 

NH-Ode 

fBu Jrt Pb» S |Bu Jrt Pbf 

Boc-IEGPTLRQWLAARA-GGG -HN CO-GGGG-I|GPTLRQWLAARA _ OCHj - ^^ 
IBu PbfBoc I tBu ptrfBoe 

2% H 2 NNH 2 /NMP| Wang resin 

tBu Jrt Pbf S |Bu Jrt Pbf 

Boc-IEGFTLRQWLAARA-GGG -HN CO-GGGG-I|GFTLRQWLAARA--OCHj-^^ 
tBu PbfBoc »U PbfBoc 

(BrCHjCOfcOj Wang resin 

jBu Jrt Pbf S |Bu Jrt Pbf 

Boc-IEGPTLRQWLAARA-GGG -HN CO-GGGG-IEGPTUIQWIAARA - OCHj-^^ 
tBu PbfBoc W PbfBoc 

IWang resin 

BrJL 

H-IEGPTLRQWLAARA-GGG-HN CO-GGGG-IEGPTLRQWLAARA-OH 

peptide 1 7b 

MaO-l PEGSOOOT -SH | p H8 
MeQ- 1 PEG 5000 |~ S N X rj|| - - 

H-IEGPTLRQWLAARA-GGG-HN^CC>GGGG-IEGPTlJlQWLAARA-OH 

peptide 19 
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FIG. 6 

tBu Trt Pbf Trt fBu Trt Pbf 

h-iegptlrqwlaara<;ggcgggg-i|gptlrqwlaara— 0CHr~0 
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O I 

MeO- i PEG 5000 h N^j | pH 8 



O 

MeO-l PEG SOCK) h N^J^ 

° 1 

H-IEGPTLRQWLAARA-GGG-HN^CO-GGGG-IEGPTLRQWLAARA-OH 

peptide 20 
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I 

TCTAGATTTCTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 
1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 
61 + + + + ♦ + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 
121 ♦ ♦ + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 
181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED PEV KFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 
241 + ♦ + ■»■ + ♦ 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLMQDWLN'GKEYKCXVS N K A - 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC . 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTC 

LPAPIEKTISKAK GQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + ♦ + + ♦ + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTtiPPSRDELTKNQVS LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFY PSDIAVE WESNGQ P - 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 ♦ + ♦ ♦ + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS F FLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACT AC GT ACTCC GAGAC GTGTTGGTG ATGTGCGTC TTCTCGGAGAGGGAC AGAGGC CC AT 
MHEALHNHYTQKSLSLSPGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTT 

721 + + + + ♦ + 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
GGGGGI EGPTLRQWLAA R A * - 

BamHX 
I 

AATCTCGAGGATCC 

781 + 794 

TTAGAGCTCCTAGG 
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FIG. 8 



Xbal 
I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 ♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MD KTHTC P* 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + ♦ + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRTPEVTC VVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + * + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCACTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 ♦ + + ♦ + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTC CTGCACCAGGACTGGCTGAATGGCAAGGAGTAC AAGTGC AAGGTCTCCAACAAAG 

301 + * + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTC^ 

VLHQDWLNGREYKCKV9NKA* 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC'AGCCCCGAGAACCAC 

361 * + + + * + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGC 

LPAPXEKTI SKAKGQPRE P Q * 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ♦ ♦ ♦ ♦ + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPP3RDELTKNQV3LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 -■ + ♦ + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGPY PSDIAVEWE S N G Q P • 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + ♦ + + + ♦ 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENN YKTTPPVLDSDGSPFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVF3CSV- 

TGATGCATGAGGCTCTGCACAACGACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + ♦ ♦ ♦ ♦ + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLS L9 PGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 + + - - + + + 780 

TTCCACCTCC ACC ACCATAGCTTCC AGGCTGAGACGCAGTCACCGACCGACGAGC ACGAC 
GGGGGIBGPTLRQWLAARAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 

781 + + ♦ + + ♦ 840 

CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 
GGGGGGGI EG PTLRQWLAAR* 

BanHZ 
I 

GCGC AT AATCTC GAGGATCCG 

841 + + - 861 

CGCGTATTAGAGCTCCTAGGC 

c A * - 
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FIG. 9 



Xbal 
I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTCC 

1 ♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

61 + + - • ♦ + + 120 

CAGTCACCGACCGACGAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
QWLAARAGGGGCGGG I EG P T - 

CCCTTCGCCAATGGCTTGCAGCACGCGCAGGGGGAGGCGGTGGGGACAAAACTCACACAT 

121 + + + + + + 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 
LRQWLAARAGGGGGDKTHTC- 

GTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAA 

181 ♦ + + + + 240 

CAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTT 
PPCPAPELLGGPSVFLFPPK- 

AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 

241 + + + ♦ + + 300 

TTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGC 
PKDTLM I SR T PEVTCVVVDV- 

TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 

301 + + + ♦ + + 360 

ACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTAT 
S HED PEVKFNWYVDGVEVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCC 

361 + + * ♦ + + 420 

TACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGG 
AKTK PREEQYNSTYRVV 3VL- 

TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 + + ♦ + + .+ 480 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
TVLHQDWLNGKEYKC KV SNK* 

AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGACAAC 

491 + + + • - + + + 540 

TTCGCTOIGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTC 

AL.PAPZERTI9KAKGQPREP* 

CACAGCTGTACACCCTGCC<;CCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGA 

541 ♦ + + + + + 600 

GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACT 
QVY T L P PS RDE LT KNQV 3 L T - 

CCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGC 

601 + + ♦ + + ♦ 660 

GGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCG 
CLVKGFYPSDIAVEWESNGQ- 

AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 

661 ♦ ♦ + * + + 720 

TCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGG 

PENNYKTTPPVLD3DGSFPL- 

TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 

72i + + + ♦ + + 780 

AGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAA^AGTACGA 
Y SKLTVDKSRWQQGNVF.SC 3- 

CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

781 + + + + ♦ ♦ 840 

GGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCC 
VMHEALHMHYTQKSLSLS P G • 

BamKZ 

I 

GTAAATAATGGATCC 

841 ♦ 855 

CATTTATTACCTAGG 
K * 
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FIG. 10 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR* 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 + ♦ + + ♦ ♦ 120 

CAGTCACCGACCGACGAGCACGACCACCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTG 
QWLAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + + * ♦ + + 180 

GAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGT 
C PAPELLGG PSVFLF P.PKPK- 

AGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCC 

181 + + ♦ ♦ + + 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 
DTLMI S RT PEVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 + + + + + + 30O 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 
ED PEVKFNWYVDGVEVHNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 + ♦ ♦ + + + 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 
TKPREEQYNSTYRVVSVLTV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

361 + + + ♦ - ♦ 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
LHQDWLNGKEY KCKVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 

421 + + + + ♦ - -+ 480 

AGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCC 
PAPIEKTISKAKGQ P R E P Q V - 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCC 

481 + + + + ♦ + 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 
YTLPPSRDELTKNQVSLTCL- 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 

541 + + + + + + 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
V KG PY P S D I A VEWE S NGQ P E - 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 

601 + + + + + ♦ 660 

TCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGT 

NNYKTT PPVLDSDGS P PLY S- 

GCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGA 

661 + ♦ + + ♦ + 720 

- CGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACT 

K LTVDKSRWQQGNVPSC S V M - 

TGC ATGAGGCTCTGC AC AAC C ACT AC ACGC AG AAGAGC CTC TCCCTGTCTC C GGGTAAAT 

72i + ♦ + + ♦ + 780 

ACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 
HEALHNHYTQKSLSLS P G K * - 

BamHI 
I 

AATGGATCC 

781 789 

TTACCTAGG 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24782 



11/37 



PCT/US99/25044 



FIG.11 



E 
o 

X 

j— 

Q. 




0 1 



4 



5 6 7 8 9 
Days Post Treatment 



10 11 12 13 14 



• Carrier 

o PEGMGCF 
tTMP 
v TMPTMP 

■ PEGTMPTMP 

□ TMETMPFc dimer 

♦ FcTMPTMP dimer 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24782 



12/37 



PCT/US99/25044 



FIG.12 



10000 
9500 
. 9000 
8500 
8000 
7500 
7000 
«, 6500 
° 6000 
5500 
5 5000 
f 4500 
°- 4000 
3500 
3000 
2500 
2000 
1500 
1000 
500 

0 




1 I I I l l l l l l J l I l l I l l I l 

2 3 4 5 6 7 8 9 1011 1213 141516 17 18 1920 21 



10000 
9500 
9000 
8500 

8000 
7500 
7000 
6500 
6000 
5500 
5000 
4500 
4000 
3500 
3000 
2500 
2000 
1500 
1000 
500 



0 



Days Post Treatment 



• Carrier 

o PEG - MGDF 

▼ TMPTMPFcdimer 

y._FcTMPTMPdimer 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24782 



13/37 



PCTAJS99/25044 



FIG. 13 



Xbal 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + ♦ + + + + 60 

AG ATC T AAAC AAAATTG ATT AATTTC CTCCTT ATTGT AT AC CTGTTTTGAGTGTGTACAG 

MD KTHTC P - 
CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + ♦ + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRT PEVTC V VVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTCGACGGCGTGGAGGTGCATAATG 

181 + + + ♦ + + 240 

CGGTCCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + ♦ + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKEYRC K V S N K A - 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIERTI SKAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTL P PSRDELTKNQVS LTC* 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 v ♦ + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 

LVKGPYPSDIAVBWESNGQP- 
CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS PFLY* 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVPSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + ♦ + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLSLS PGR- 

AAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTT 

72i ♦ • • - + + + + * 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 
GGGGGGGTY SC H'PG PLTWVC- 

* 

BamHI 

I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

781 + + + -- 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG 
K P Q G G * 
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FIG. 14 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGGAGGTACTTACTCTTGCC 

1 + ♦ + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 

MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGGACA 

61 ♦ + + ♦ + ■»■ 120 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
FG PLTWVC KPQGGGGGGGDK- 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 + ♦ • • - + + + + 180 

TTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGG 
THTCPPCPAPELLGGPSVFL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 + ♦ + ♦ + + 240 

AGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGC 
F P PKPKDTLMI SRTPEVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 + + + + + + 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
VVDVS HEDPEVKFNWYVDGV- 

TGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTG 

301 + + + + ♦ ♦ 360 

ACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCAC 
EVHNAKTKPREBQYN9TY RV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 + ••• • + + ♦ + ♦ 420 

ACCAGTCGCAGGAGTGGCAGGACGTGCTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
VSVLTVLHQDWLNGKEY KCK- 

AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

421 + + 4- + + t 480 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT^ 

VSNKALPA PIEKTI S KAKGQ- 

AGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACC 

481 ♦ + + + • 540 

TCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGG 
PREPQVYTLPPSRDELTKMQ- 

AGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGG^ 

541 + + + + + + 600 

TCCAGTCGGACTGCACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
VSLTCLVKGFYPSDIAVBWE- 

AGAGCAATGGGCACCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 

601 + + ♦ + " + * *•+ 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 

S N G Q PENNY KTTP PV LD SDG* 

GCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACG 

6gl + + + ♦ + + 720 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 
3 F F h Y 3 K L T V D K 3 R W Q a G-N V- 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCT 

721 + + + + + + 780 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 
FSCSVMHEALHNHYTQKSLS- 

BaxnKI 
I 

CCCTGTCTCCGGGTAAATAATGGATCC 

781 ♦ + 807 

GGGACAGAGGCCCATTTATTACCTAGG 
L S P G K • 
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FIG. 15 

i 

TCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATGGGAGGTACTTACTCTTO 

1 + «- + + + + 60 

AGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 
b MGGTYSC 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG 

61 + +■ ♦ ♦ ♦ + 120 

GGTGAAGCCGGGTGACTGAACCCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACC 
b H FG PLTWVCKPQGGGGGG'GG 

TACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

121 + ♦ ♦ + + ♦ 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
b TYSCHFGPLTWVCKPQGGGG- 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

181 ♦ + + * + + 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 
b GGGDKTHTCPPCPAPELLGG- 

ACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCC 

241 + + ♦ + «■ '-+ 300 

TGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGG 
b PSVFLPPPKPKDTLMISRTP- 

TGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTG 

301 + + + + + 360 

ACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGAC 
b EVTCVVVDVSHBDPEVKFMW- 

GTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAA 

361 + + ♦ * + + 420 

CATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTT 

b YVDGVEVHNAKTKPREEQYN- 

CAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAA 

421 + + ♦ + + - + 480 

GTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTT 
b STYRVVSVLTVLHQOWLNGK 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTC 

48i ♦ ♦ * + — + 540 

CCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAG 
b EYKCKVSNKALPAPIEKTIS- 

CAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGA 

541 + + + + + + 600 

GTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACT 
b KAKGQPREPQVYTLPP9RDE 

GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACAT 

601 + + ♦ ♦ + ♦ 660 

CGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTA 

b LTKNQVSLTCLVKGFYP3 D I - 

CGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 

661 + + ..- + + + + 720 

GCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCA 
b AVBWESNGQPBMNYKTTPPV. 

GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG 

721 + + + ♦ ♦ + 780 

CGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCAC 
b LDS DGSFFLYSKLTVDKSRW- 

GCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACAC 

781 + + + + + + 840 

CGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTG 
b QQGNVFSCSVMHEALHNHYT- 

BamHI 
I 

GCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

841 ♦ + + ; 881 

CGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

b QKSLSLSPGK* 
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FIG. 16 



Xbal 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 - - + + + -~ + ♦ - - - - + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACC^ 

K O K T H T C P- 

CACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC^ 

61 ♦ + + + + -•--+ 120 

GTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTO 
PC PA PELLGGPSV FLFP P K P - 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTCGACGTGA 

121 -f ♦ + + + + 180 

GCTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 

KDTLMI S RTPBVTCVV-VDVS* 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

C GGTGCTTCTGGGACTCCAGTTC AAGTTGAC C ATGC ACCTGCCGC ACCTCC ACGTATTAC 
HEDPEVKFNWYVDGVEV H N A - 

C C AAGAC AAAGCC GC GGG AGG AGC ACT AC AAC AGCACGT AC C GTGTGGTC AGC GTCCTCA 

241 ♦ + + ♦ ♦ + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + .--+=-- + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKEYKC KVS N K A - 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + ♦ + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIEKTISKAKGQPRE P Q - 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + + + + 480 

TCCACATGTGGGACGGAGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPPSRDELTKNQVS LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGCCAGC 

481 + ♦ + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGPYPSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCC TCTTGTTGATGTTCTGGTGCGG AGGGC ACGACCTG AGGCTGCC GAGG AAGAAGGAG A 
ENNYKTTPPVLDSDGS F FLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ ♦ + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSC S V - 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ♦ + + ♦ + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLS LS P G K - 

AAGGTGGAGGTGGTGGCGGAGGTACTTACTCTTGCCACTTCGGCCCACTGACTTGGGTTT 

721 + + + + + + 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
GGGGGGG'TYSCHFGPL T~W V C • 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 + * + + ♦ + 840 

CGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCG 
K PQGGGGGGGGTYSCHFG P L - 

BaraHI 
I 

TGACCTGGGTATGTAAGCCACAAGGGGGTTAATCTCGAGGATCC 

841 ♦ ♦ + + 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 

TWVCKPQGG* 
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FIG. 17A 

(AafcH sticky end] 5 1 GCGTAACGTATGCATGGTCTCC - 

(position #4358 in pAMG21) 3' TGCACGCATTGCATACGTACC AGAGG - 

- CCATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT - 

- GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA - 

* 

- GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC - 

- CCCGGAAAGCAAAATAGACAACAAACAGCC ACTTGCGAGAGGACTCATCCTGTTTAGGCG - 

- CGGGAGC GGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC - 

• GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG - 

- CATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT - 

- GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA - 

AflLLH 

- TTCTACAAACTCTTTTGTTTATTTTTCTAAATACATTCAAATATGGACGTCGTACTTAAC - 

- AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG • 

• TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC • 

- AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG - 

- GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC - 

- CCAAAC AACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG * 

- TACAGCCTAATATTTTTGAAATATCCCMGAGCTTTTTCCTTCGCATGCCCACGCTAAAC - 

- ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG • 

- ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC - 

- TAAGAAAAAGAGAAAACC AATTTAGCAAC AAACTAAATAATAAACGATATAAATAAAAAG - 

- GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTCATACACGCATGTAAAAATA - 

- CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT - 

- AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT - 

- TTGATAGATATATCAACAGAAAGAGACTTACACGTTTTGATTCGTAAGGCTTCGGTAATA - 

- TAGC AGTATGAATAGGGAAACTAAACCCAGTGATAAGACCTGATGATTTCGCTTCTTTAA - 

- ATCGTC ATACTTATCCCTTTGATTTGGGTC ACTATTCTGGACTACTAAAGCGAAGAAATT - 

• TTAC ATTTGGAGATTTTTTATTTACAGCATTGTTTTCAAATATATTCCAATTAATCGGTG - 
AATGTAAACCTCTAAAAAATAAATGTCGTAACAAAAGTTTATATAAGGTTAATTAGCCAC - 

AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTCATCAT - 
TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA - 

AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG- 
TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC - 

AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG- 
TTACTCCTATTTACTAGCGCTCATTTAl^ATAAGTGTTACATGGTAAAATCAGTAT 

ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT- 
TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA- 

AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC - 
TTCACAGCAGCCGTAAATACAGAAAGTATGGGTAGAGAAATAGGAATGGATAACAAACAG- 

GCAAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA- 
CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT- 
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FIG. 17B 



- ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG - 
- TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC - 

- TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT - 

- ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA - 

- CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA - 

- GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT - 

- GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA - 

- CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT - 

" GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA - 

- CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT - 

- ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG - 

- TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC - 

- AACCGCTCTTCACGCTCTTCACGC 3' [fiafill sticky end] 

TTGGCGAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) 
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FIG.19A 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG " 
GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MD KTHTCP PC P A P E L L GGP ■ 
TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 
6 1 AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 
SVFLFPPKPKDTLMISRTPE 
GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 
1 CAGTGTACGC ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC AAGTTGACCATG 
VTCVVVDVSHEDPEVKFNWY 
GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

1 1 CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVE VHNAKTKPREEQYNS - 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

2 4 TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 
TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATC ^ g q 

3 ° 1 ATGTTCACGTCCAGAGGTTC^ 

YKCKVSNKALPAPIEKTISK • 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGA^ ^ ^ q 

3 6 1 CGGWTCCCGTCGGGGCTCWGGTCTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL - 
ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGC^ ^ 

4 2 1 TGGWCTCGGTCCAGTCGGACTGGACGGACCAGT^ 

TKNQVSLTCLVKGFYPSDIA - 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTC^ ^ 

4 8 1 CACCTCACCCTCTCG^ACCCGTCGGCCTC 

VBWBSMq'qPBNNYKTTP'PVI,-"- 

GACTCCGACGGCTCCTTCTTCCTCTACAGC AAGCTCACCGTGG A^^AGGTGGCAG ^ ^ ^ 

54 1 ctgaggctgJcgaggaa^^ 

DSDGSFFLYSKLTVDKSRWQ - 
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FIG. 19B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTC ^ 
1 GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGT 

QONVFSCSVMHEALHNHYTQ - 
AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTCGTCACTTCCTGCCGCACTAC ^ ^ ^ 

1 TTCTCGGAGAGGGACAGAGGC^^ 



KSLSLSPG 



KGGGGGDFLPHY 



BatnHI 
I 

AAAAACACCTCTCTGGGTCACCGTCCGTAATGGATCC ^ ^ 
721 TTTTTGTGGAGAGACCC AGTGGCAGGCATTACCTAGG 
KNTSLGHRP* 
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FIG. 20A 



Ndel 
I 



C ATATGGACTTCCTGCCGCACTACAAAAACACCTCTCTGGGTCACCGTCCGGGTGGAGGC 

1 + + + + + + 60 

GTATACCTGAAGGACGGCGTGATGTTTTTGTGGAGAGACCCAGTGGCAGGCCCACCTCCG 

MDFLPHYKNTSLGHRPGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

fi , +. + + + + + " u 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP - 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 AGTCAA^GGAG^GG^GGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLPPP. KPKDTLMISRTPB - 
GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 
1 CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 
VTCVVVDVSHEDPEVKFNWY - 
GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC ^ 

24 1 CACCTGCCGCACCTCCACG^ 

VDGVEVHN AKTKPRBEQYNS - 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCT^ 3 g q 

3 ° 1 TGCATGGCA^ACCAGTCGCAGGAGTGGCAGGACCT^ 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAAC^ ^ 

3 6 1 ATGTTCACGTTCCAGAGG^GTTTCGGGAGGCT^ 

YKCKVSNKALPAPIEKTISK - 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGC ^ & Q 

4 2 1 CGGTTTCCCGTCGGGGCTCTTGGTCTCCACATGTGG 

AKGQPREPQVYTLPPSRDEL - 

ACCAAGAACCAGGTCAGCCTCACCTGCCTGCT^ ^ 

4 8 1 TGGTTC'TTGGTCCAGTCGGACTGGACGGACCA 

TKNQVSLTCLVKGFYPSDIA - 

gtggagt«;gagagcaatgggcagccggagaacaactacaagacca^ goo 
54 1 cacctcaccctctcgctacccgtcggcctcttc^ 

VEWESNGQPENNYK TTPPVL - 
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FIG. 20B 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + + + + + ...-..-..+ 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 + + + + + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 



BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG 

72 i + + + + - 761 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGGCGCC 

KSLSLSPGK* 
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FIG.21A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

^ + + + + + - + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

g ^ ...... ... + -..--««-- + •- •- -- -- • + ••■•'"""•" + """"""■"* + ""*■'"" " * * 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + T--- + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 



V T 



CVVVDVSHEDPEVKPNWY 



GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 
181 CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 
241 TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + ** + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 
GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 CGG^TCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 
AKGQPREPQYYTLPPSRDEL 
ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC ^ 

421 TGGTTCCTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 
TKNQVSLTCLVKGFYPS DIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 
. oi + -- + -'- + + " - ~- " **■ + 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 
VEWESNGQPENNYKTTPPVL - 
GACTCCGACGGCTCCTTCTTCCTCTACAGC AAGCTCACCGTGGACAAGAGC AGGTGGCAG 
1 CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 
DSDGSFF LYSKLTVD.KSRWQ • 
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FIG.21B 



CAGGGGAACGTCTTCTCAT^TCCGTGATGCATGAGGCTCTGCACAACCA^^ ^ 
1 GTCCCCTTGC^GAAGAGTAWAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ - 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTC ? 2 Q 

661 + ■•■"_:: " l qqcCCATTTCCACCTCCACCACCAAAGCTTACCTGGGGCCCA 



TTCTCGGAGAGGGACAGA 
K S 



LSLSPGKGGGGGFBWTPG 



BamHI 

TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG ? g ^ 
721 ATGACCGTCGGCATGCGAGACGGCGACATTACCTAGGGAGCTC 

YWQPYALPIi* 
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FIG. 22A 



Ndel - - - - - - 

CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

I + + + + + + 60 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWTPGYWQPYALPLGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

gl + + + + + + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

12 x + + + + + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 



S V 



FLF PPKPKDTLMISRTPE 



GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

^g^ + + + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 



V T 



CVVVDVSHEDPEV KFNWY 



GTGGACGGCGTGGAGGTGC ATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 
241 CACCT^CG^CCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 
VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

10 i + + + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVIiHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 ATGTCCACGTTC^ 



Y K C 



KVSNKALPAPIEKTISK 



GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 
421 CGGOTTCCCGTCGGGGCTCTTGGTGTCC ACATGTGGGACGGGGGTAGGGCCCTACTCGAC 



A K G Q 



PREPQVYTLPPSRDEL 



ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

401 + + + + + + b 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVS LTCLVKGFYPSDIA 
GTGGAGTGGGAC^GC AATGGGC AGCCGGAGAACAACTAC^GACCAC g q q 

541 CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTC^^ 

VEWESNGQPENNYKTTPPVL - 
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FIG. 22B 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + + + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 + + + + " ' + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 - + + + 757 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

KSLSLSPGK* 
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FIG. 23A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

1 + + + + + - + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

51 + + + + + + 120 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE. 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

I2i + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + -.-- + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP. REEQ YNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 



T Y 



RVVSVLTVLHQDWLN GKE 



TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + * + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

+ + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 



A 



KGQPREPQVYTLPPSRDEL 



ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + -.+----- + 480 

TGGTTCTTGGTCC AGTC GGACTGGACGGACC AGTTTC CGAAGATAGGGTCGCTGTAGCGG 



T 



KNQVSLTCLVKGFYPSDIA 



GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

c^i + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ - 
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FIG. 23B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + ♦ 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

661 + + + + + -• + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLSPGKGGGGGVEPNCD 

BamHI 
I 

ATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTGTAACTCGAGGATCC 

721 + + + + --.+--- 773 

TAGGTACAATACACCCTTACCCTTACAAAACTTGCAGACATTGAGCTCCTAGG 

I HVMWEWECFERL * 
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FIG. 24A 



Ndel " - - 

I 

CATATGGTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGT 

1 + + + + + + 60 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVEPNCDIHVMWEWECFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 

61 + + + - - - + + + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LGGGGGDKTHTCPPCPAPEL 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 + + + + + + 180 

GACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGG 

LGGPSVFLFPPKPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 

181 + + + + + + 240 

GCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

RTPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + + + + + + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 + + + + + + 360 

GTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWL 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + + + + + + 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 

NGK EYKCKVSNKAIiPAPIEK 

* 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + + + + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

TISKAKGQPREPQVYTLPPS 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGXCAAAGGCTTCTATCCC . 

481 + + + + + + '540 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 

RDELTKNQVSLTCLVKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

541 + + + + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

SDIAVEWESNGQPENNYKTT 
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FIG. 24B 



CCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 

601 + + + + •- + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

PPVLDSDGS FFLYSK LTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 

661 + + + + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

SRWQQGNVFSCSVMHEALH N 

BamHI 
I 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 

721 + + + + + 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 

HYTQKSLSLSPGK* 
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FIG. 25A 



Ndel 
I 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATfiTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + +. + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQD WLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + --- + +------ + ..-.-..-.+ 5.40 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 
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FIG. 25B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + - + * .-.+...--....+ 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

KSLSLS PGK GGGGGCTTHWG 

BamHI 
I 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 + + "* 748 

AAGTGGGACACGATTACCTAGGGAGCTC 



F T L C 



* 
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121 



181 



Ndel 

CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 

I + + + + + + 60 

GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 

MCTTHWGFTLCGGGGGDKG 

GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 

61 + + + + + + 120 

CCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCC 

GGGGDKTHTCPPC PAPELLG 

GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC 

+ + + + + + 180 

CCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGG 

GPSVFLFPPKPKDTLMISRT 

CCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC 

+ + + + + + 240 

GGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTtCAAGTTG 

PEVTCVVVDVSHEDPEVKFN 

TGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC 

241 + + + + + + 300 

ACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 

WYVDGVEVHNAKTKPREEQY 

AACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGC 

301 + + + + + + 360 

TTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCG 

NSTYRVVSVLTVLHQDWLNG 

AAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATC 

+ + + + + + 420 

TTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAG 

KEYKCKVSN K A LPAPIEKTI 
TCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGAT 
AGG^TTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 
SKAKGQPREPQVYTLPPSRD 

GAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGAC 

4Q1 .;...--.-+- + + +.---- — - -+_- + 540 

CTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTG 

EL .TKNQVSLTCLVKGFYPSD - 

ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC 

TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGG 

IAVEWESNGQPENNYKTTPP 
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GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 

g 0 l + + + + + + 6 fi0 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 

V LDSDGSPFLYS KLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 

661 + + + + + + 720 

ACCGTCGTCC CCTTGC AG AAGAGTACGAGGC ACTACGTACTC CGAGACGTGTTGGTGATG 

WQQGNVFSCSVMHEALHNHY 

BamHI 
I 

ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 ---- + ♦.....---- + -... + .-. 763 

TGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

TQKSLSLSPGK* 
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